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AN ANALYSIS OF SOME FAILURE DATA 


D. J. Davis* 
The Rand Corporation 


This paper summarizes the raticnale and statistical tech- 
niques employed in the analysis of some failure data obtained 
from operations performed by machines and people. These 
data are compared to frequency distributions arising from 
either an exponential or a normal theory of failure. The agree- 
ment between theory and data is evaluated. 


INTRODUCTION 


HE probability distribution of time-to-failure in a system composed 
T of equipment and operating personnel is important in establishing 
planned maintenance schedules and in estimating the frequency of 
breakdown with the attendant operating time lost during repair. It is 
proposed here to examine some failure distributions together with a 
rationale of the mathematical models used to represent such distribu- 
tions. 

DEFINITIONS OF TERMS 


A system is defined as a complex of machines and operators which, in 
response to a given set of inputs or stimuli, is required to react in pre- 
scribed ways. Any reaction of the system which does not lie within the 
prescribed limits of response constitutes a failure and the absence of 
failure during a period of observation is termed success. Only systems 
which operate successfully at the beginning of the period of observation 
are examined. The period of observation will be described either by 
conventional time limits or, when more convenient, by the number of 
discrete operations of a system. The term reliability is used to denote 
the probability that a system does not fail during a period of observa- 
tion; reliability can then be estimated on the basis of observed data. 





* The advice, suggestions, and criticism of Bernice B. Brown, George W. Brown, T. E. Harris, 
A. M. Mood, M. P. Peisakoff, and John Walsh are responsible for whatever success I hav~ had in the 
research reported in this paper. Most especially I acknowledge my debt to W. J. Youden. 
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RATIONALE 


Consider a population of systems which are assumed to be similar 
both in characteristics and in the conditions of use. An individual sys- 
tem is first put to use at time t=0, at which time the system is in a non- 
failed condition. The system then operates successfully until ¢=T7', at 
which time the system fails and remains in a state of failure thenceforth 
(or the failed system is discarded). Thus, the variable ¢ represents the 
age of a system and T represents the age at failure of that particular 
system (both ¢ and T' have only non-negative values). 

In view of the similarity of the systems in the population and of their 
condition of use, it appears reasonable that the time-to-failure or life- 
time T of individuals of the population has a probability distribution. 
Let F(t) be the probability that a system selected randomly from the 
population has failed during the time interval 0 to ¢ (cumulative proba- 
bility), or 


Ft)=Pr{Tst}; O0<T<o. 
Also let f(t) be the time derivative of F(t) or 
fidt = Pr {ts TSt+dt}. 


Thus f(t) is the probability density function of failure time. It should 
be noticed that this represents the frequency with which members of 
the total population starting at t=0 will fail in the interval ¢ to ¢+d. 

Let us now introduce a new probability function which has been 
found useful in interpreting the physical causes of failure in terms of 
probability distributions. This function is termed the conditional den- 
sity function of failure probability with time and is defined as the in- 
stantaneous probability rate of failure at time ¢ conditional upon 
non-failure prior to time t. Let Z(t) be this conditional density function, 
from which 


Z(tdt = Pr {t S$ TSt+dt} giventhat {T2t}. 


The actuarial concept of “force of mortality” is precisely this condi- 
tional density function if a human being is considered as a system and 
death is defined as the failure of the system. 

The definition of the conditional density function, Z(t), leads to the 
following relation between it and the cumulative failure probability 
function, F(t), and the failure probability density function, f(t); 


(1) 
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Integrating, 


— f aeae = In [1 — F(t)] 


F(t) = 1 — exp | - f Z(a)de |. (2) 
Reliability then becomes 

1 — F(t) = exp | - f Z(e)de (3) 
and 

f) = 20 exp | — f z(2)ae). (4) 


The conditional density function is a convenient device for examining 
systems which are subject to failure from several independent causes 
characterized by Z,(#), Z2(t) - - - . The total conditional density func- 
tion for all causes of failure is then 


Z(t) = > At 
and 


F(t) = 1 — exp | - % f zterde | ‘ (5) 


In Figure 1, the cumulative probability of failure, (¢), the failure 
probability density function, f(t), and the conditional density function, 
Z(t), are compared diagrammatically for three different failure distri- 
butions: normal, exponential, and human mortality.! The first two dis- 
tributions are those which have been found applicable to the systems 
here considered; the human mortality distribution is introduced for 
illustrative purposes. The normal theory of failure is so named since the 
familiar normal probability density function, 


10 - ae [-3()} 


1 Mortality data obtained from [1]. 
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CUMULATIVE PROBABILITY 


FAILURE PROBABILITY 


(u and o are the population mean and standard deviation of time to fail- 
ure respectively) is assumed to describe failure distributions of some 
systems. Since ¢ is non-negative, the normal theory is an acceptable 


CONDITIONAL DENSITY 
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Fia@urE 1. Cumulative probability, probability density, and conditional density 
functions compared for three theories of failure. 


description of failure only if /° f(t)dt is a negligible quantity, say 
u>3c. If, however, u<3c, a truncated normal distribution could be 
used. Let f(t) be the normal probability density function and f’(t) be 
the truncated distribution. Then the truncated function is defined as: 
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The exponential theory of failure will be defined later. 

A population of systems which follows a normal theory of failure 
displays a conditional density function which has zero value in the 
early phases but increases at an accelerated rate throughout its life. 
The shape of the conditional density function qualitatively agrees with 
our intuitive knowledge of failure for such common objects as shoes or 
automobiles: a new system is reliable, an old one unreliable. 

Preventive maintenance procedures, by which parts are replaced 
after a fixed lifetime or at a given state of wear, are justified by a normal 
theory of failure. Literature on mechanical wear indicates that the time 
rate of wear increases with the cumulative amount of wear. If the con- 
ditional density function of failure of a system is assumed to increase 
with increased cumulative wear of some of the system components, 
then the replacement of worn components at a specified degree of wear 
will decrease the conditional density failure function for such systems. 
Determining an optimum preventive maintenance schedule requires an 
economic balance of the cost of component replacement versus the cost 
of system failure. 

The conditional density function for human mortality is similar in 

general characteristics to that of the normal theory except that in early 
life human mortality exhibits a non-zero conditional density. This sug- 
gests the rationale that in youth, humans are subjected to a small 
death-hazard rate (force of mortality), but as they age they become 
increasingly weaker and, therefore, subject to an increasing death- 
hazard rate. 
With mechanical systems, failure being an undesirable occurrence, 
replacement and preventive maintenance techniques will generally be 
instituted at a time when, from observation, the systems tend to be- 
come prone to failure. Therefore, an exponential theory of failure is 
proposed under which it is assumed that the conditional density func- 
tion for systems, in which failure is highly undesirable, is a constant 
(see Fig. 1). Under this theory Z(t) =K and equation (2) becomes: 


F(t) = 1 — e-*! (6) 
and equation (4) becomes 


S(t) = Ke-*'. (7) 
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It can be demonstrated that K is the reciprocal of the mean time to 
failure, r, so that equation (7) can be rewritten 


1 
f(®) =—e-"", (7a) 


It is pertinent to note that systems which are governed by the expo- 
nential theory of failure, produce a Poisson distribution of number of 
failures during equal intervals of time if systems which have failed are 
instantly replaced. 

Conceptually, the mechanism of failure may be described in terms 
of the primitives environment, failure resistance, and complexity. In 
usual operations, a system will be subjected to an environment of vary- 
ing severity which may be described by a statistical distribution. A 
population of systems will exhibit a distribution of failure resistances 
among the individual systems, and individual systems will most likely 
show a change in failure resistance as the result of the continued im- 
pingement of the environment upon them. It is impliec! that an instance 
of the environmental severity exceeding the failure resistance of a sys- 
tem at a particular time constitutes failure at that time. Complexity, 
although constant for a particular type of system, is of use in comparing 
different types of systems. It may be thought of as the number of ways 
in which the system can fail. 


DATA EXAMINED 


The systems examined were chosen on the basis of availability of 
data, and to obtain as great differences in the system characteristics as 
possible. Data already in existence were collected and analyzed, for the 
most part, since the generation of new data tends to be expensive and 
time consuming. The selection of systems having widely different char- 
acteristics gives a broad understanding of failure phenomena in general. 
Most systems fail from a combination of human and mechanical causes. 
Some data on failures caused by the human aspect alone are examined 
as well as those arising from a mixture of human and mechanic: 
causes. The sources of data are not presented here since most of them 
came from private commercial records under the condition that the 
source not be revealed. 

Observed failure data are compared to predictions derived from the 
previously stated failure theories: the exponential theory which is char- 
acterized by the probability density function of equation (7a) and the 
normal theory which is characterized by the previously stated normal 
failure probability density function. The constants used in theoretical 
predictions are estimated from the failure data. 
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The analysis of the failure data on the following systems is presented 
in the appendix: 


I. Bank Statement and Ledger Error 
II. Student Typing Errors 
III. Lost or Forgotten Identification Badges and Cards 
IV. Payroll Check Errors 
V. Linotype Machine Failure 
VI. Powered Hand Tool Failure 
VII. Automatic Calculating Machine Failure 
VIII. Electric Motor Centrifugal Switch Failure 
IX. Radar Set Component Failure 
Vacuum Tube 
Vacuum Tube 
Vacuum Tube 
Vacuum Tube 
Resistor 
Resistor 
Resistor 
Network 
Crystal 
Knob 
Combination of Transformers 
Combination of Relays 
M. Combination of Capacitors 
N. Combination of Various Components 
X. Commercial Aircraft Radio Tube Failure 
XI. Dry Cell Failure 
XII. Incandescent Lamp Failure 
XIII. Bus Motor Failure 


SMrAS ROAR SOW 


STATISTICAL TECHNIQUES USED IN ANALYSIS OF DATA 
I. Method of Testing Agreement of Data with a Theory 


The chi-square test [2, Chap. IV] is employed in comparing the ob- 
served distribution of occurrences with the distribution which is pre- 
dicted by the theory. It establishes a method of evaluating the proba- 
bility that such variations as are observed would arise if the theory is 
correct. It thus gives a quantitative evaluation of the agreement be- 
tween observed data and theory. Acceptable agreement is taken to 
indicate that the assumed theoretical distribution gives a reasonable 
description of the actual occurrences. 
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II. Testing Agreement of Data with the Exponential Theory of Failure 
(Z(t) =K) 


A. Large samples of lifetimes 


When the data consist of a large number of times-between-failure or 
lifetimes, ostensibly from the same population, convenient intervals of 
lifetime are chosen and the frequency of observed failures is found for 
each interval. The corresponding theoretical frequencies are then calcu- 
lated from the density function 


eT (7a) 


Nf) = 


si] = 


where 
N =the total number of failures in the sample 
T =sample estimate of average time to failure. 
The expected number of failures X in the interval of time t; to & is 


ta _}*2 

X= Nf(it)dt = — Newt] R (8) 
t ti 

Observed and theoretical frequencies are then compared by the chi- 

square test. 


B. Frequency-of-occurrence data, using the Poisson distribution 


When the data consist of the number of failures in each of a large 
number of equal time intervals, the Poisson distribution is employed. 
Under the above conditions, it can be shown that for the exponential 
theory the mechanisms are subject to the same expectation of failure 
during each time interval and that a Poisson distribution describes the 
number of failures per interval. The Poisson distribution gives the 
probability of zx failures as 


z 


X 
P(x) = —e> 
x! 


where A is the expected number per interval, and P(x) is the proba- 
bility of precisely x failures in an interval. The expected number, X, of 
intervals exhibiting precisely x failures is X = NP(zx), N being the total 
number of intervals in the sample. 

The number of intervals observed to exhibit each number of failures 
is compared with that predicted by the Poisson distribution, using the 
chi-square test. For this, sample estimate, r, of the population parame- 
ter is used, If the data show satisfactory agreement, it seems reason- 
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able to suppose that the causes of failure operate like an exponential 
failure distribution. 


C. Manipulation of a series of lifetimes to form a Poisson distribution 


A series of observed system lifetimes can be arranged in a time se- 
quence and the sequence divided into equal time intervals. The se- 
quence then represents a series of failures plotted on the time scale so 
that the times between successive failures correspond to the lifetimes 
of the successive constituents of the sample. If the exponential theory 
of failure is assumed, each interval has the same expectancy of failure. 
By this “milepost” manipulation of the data, the observed frequency 
distribution of the number of intervals exhibiting no failures, 1 failure, 
2 failures, etc., can be compared to the Poisson distribution which the 
assumed theory implies. 

This technique is used optionally in place of Method II A above but 
is particularly useful when the data consist of a moderate number of 
occurrences from each of several similar systems but with different 
estimated mean times-to-failure for each type. 

The interval length for each system is arbitrarily chosen as the esti- 
mate of mean time to failure.? Thus each interval has an expectancy of 
one occurrence. Although the several types of systems composing the 
statistic may have different estimated mean times-to-failure, they are 
assumed to exhibit similarly shaped distribution functions and, conse- 
quently each interval of each system has the same failure expectancy. 
Therefore the frequencies of the several types of systems can be com- 
bined toform a homogeneous distribution of a large number of observa- 
tions. The combined observed distribution is compared with the theo- 
retical Poisson distribution by means of the chi-square test. 


D. Small samples of frequency-of-occurrence data 


When the data consist of the number of failures in a few equal time 
intervals for numerous systems having different failure rates, an index 
of dispersion, ak i— X)?/X, is calculated for each system, X; being 
the observed number of failures in the 7** interval and X the mean 
number of failures per interval for that system. For Poisson distribu- 
tions (from the exponential theory) this index of dispersion has a chi- 
square distribution (see page 58 of Ref. 2). The observed frequency 
distribution of the ratios is compared with the theoretical distribution 
by the chi-square test. 





2 The interval length could have been chosen at any multiple or fraction of the mean time-to-failure 
and this variation of procedures has been tried many times with parallel results. Using the mean time-to- 
failure for the interval simplifies calculations and will be followed in this discussion. 














122 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


III. Testing Agreement of Data with the Normal Theory of Failure 


Visual comparison of the observed failure-frequency distribution 
curve with that predicted by the theory gives a fair index of agreement. 
A quantitative evaluation of agreement is obtained by comparing ob- 
served and theoretical frequencies within equal and successive time 
intervals and applying the chi-square test. The departure of the data 
from the normal theory can be further examined by computing meas- 
ures of skewness and kurtosis. 


CONCLUSIONS 


The exponential theory of failure appears to describe most of the 
systems examined here. Those systems which exhibit reasonable agree- 
ment with this failure theory are characterized by predominance of 
human errors as the cause or a careful and well developed operating 
technique for minimizing failure. Systems which are subject to a wide 
range of environmental severity also appear to follow this pattern. 

The normal theory of failure appears to apply to systems which ex- 
hibit small variation in failure resistance among the individuals within 
a population and which are subject to small variations in environmental 
severity. Further, the failure resistance of the mechanism deteriorates 
with time and operational procedure requires that each item be used 
until ultimate failure. In the appendix the several samples of data are 
independently compared to an appropriate theory with varying good- 
ness of fit. It is probably appropriate to examine the set of samples as a 
whole to see what statistical inference can be drawn from a larger 
sample composed of the many small samples. 




















Observed Expected (O-—E)? 
Range of p(x?) No. of No. of —_—-—— 

Examples Examples E 
0 to 0.14 7 42/7 1.72 
0.15 to 0.28 6 42/7 0.69 
0.29 to 0.42 6 42/7 0.69 
0.43 to 0.57 4 42/7 0.02 
0.58 to 0.71 1 42/7 2.52 
0.72 to 0.85 3 42/7 0.39 
0.86 to 1.0 3 42/7 0.39 

Total 30 30 6.42 =x? 








df.=6  P+0.39 
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Consider the 30 sets of data which were compared to the exponential 
theory via 30 chi-square tests. Although none of the thirty tests shed 
serious doubt on the theory, it is possible that the values of chi-square 
tend to be larger than expected (poor fit). The following table compares 
the frequency of occurrence of chi-square probabilities from the 30 sets 
of data, with the expected distribution of chi-square probabilities, P (x?): 
For those who prefer to apply the chi-square test to classes containing 
an expectation of more than 5 instances, the comparison of the above 
data with the theory via four equal classes, each having an expectation 
of 7} instances, produces a chi-square of 5.47 and a P of 0.15. Alter- 
nately, the fit of the thirty sets of data may be examined by summing 
the thirty values of x? and summing the 30 associated values of degrees 
of freedom, 


> x? = 174.69, > df. = 147, P = 0.06. 


The above tests of the entire set of data indicate no statistically sig- 
nificant departure from the exponential theory of failure. However, the 
dominance of the number of poorer fits over the number of better fits, 
leads one to suspect that the lack of disagreement between theory and 
data may well have resulted from the relatively small size of the 
samples. 

From an examination of the causes of failure for these data, it could 
have been predicted that the oversimplified exponential hypothesis of 
failure would not precisely describe the observed phenomena. However, 
the discrepancies between the theory and the data (though they may 
be significant in a statistical sense) are small enough that the expo- 
nential theory may be regarded as a useful approximation of certain 
classes of actual failure distributions. 


APPENDIX 
ANALYSIS OF DATA ON OBSERVED SYSTEMS 


I. Bank Statement and Ledger Error 
Data 


Number of correct entries between errors plus one, listed in chrono- 
logical order of occurrence over a period of 10 days. The accounts were 
kept by five clerks. Each number of correct entries between failures is 
considered as a trial of the lifetime-to-failure of a “machine.” The effect 
of fatigue on accuracy is disregarded as is the chance that the error was 
a failure of the recording machine rather than of the operator; discus- 
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sion of working conditions validates these assumptions. During the 
short period observed, the bookkeepers, being at a rather static period 
of improvement, would not be expected to change their failure rate 






































appreciably. 
Observed data 
Ist Clerk 2nd Clerk 3rd Clerk 4th Clerk 5th Clerk 
734 451 726 234 50 149 26 61 
121 3 883 133 3 74 12 305 
404 1,116 142 14 115 170 190 267 
646 1,143 196 233 65 2 170 + # 573 
1,072 447 14 46 295 129 201 282 
148 630 1,905 1,327 235 3 481 414 
312 37 456 259 781 65 157 48 
773 2,031 2,565 115 343 44 165 150 
43 1,786 610 369 49 204 75 212 
1,102 659 1,263 563 551 333 334 # 189 
111 151 347 286 105 60 671 47 
641 210 881 145 19 ll 10 95 
754 1,426 1,214 69 127 60 290 57 
598 72 248 1,822 77 20 321 160 
86 699 195 686 82 608 6 99 
2,138 426 548 183 24 19 231 139 
150 1,040 234 146 64 64 204 18 
1,047 277 1,096 268 941 113 45 609 
907 72 530 115 1,153 413 94 64 
165 1,286 338 494 147 75 176 211 
166 235 356 215 67 22 488 35 
6 625 217 539 878 403 76 249 
94 493 195 467 339 299 233 81 
1,023 2 77 4 151 396 359 294 
903 756 392 37 545 6 33 98 
355 1,460 3,114 453 565 156 333 418 
303 629 651 330 662 33 
1,378 
202 
343 
1,266 
Total Entries 17,991 17 ,533 18,742 18,273 15 ,446 
Total Errors 31 26 26 54 81 
Entries per 
Error 580 674 721 338 191 
Analysis 


Using the mean number of entries per error as a “milepost” for each 
clerk, the observed frequency Cistribution of errors was compared with 
the theoretical frequency calculated under the assumption that a Pois- 


son distribution exists (exponential theory of error). 
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Result 














Freq. of Occurrence Observed Expected 
per Milepost Freq. Freq. 
0 85 80.2 
1 76 80.2 
2 39 40.1 
3 11 13.4 
4 or more rf 4.1 
Total 218 218.0 











df.=3  »%=3.1 P+0.40 


The value of P corresponding to the computed value of x? and to the 
number of degrees of freedom is obtained from appropriate tables [2, 
Table III]. The number of degrees of freedom is defined as the number 
of discrete comparisons of observed and expected (theoretical) fre- 
quencies, diminished by the number of parameters of the expected dis- 
tribution which were obtained from the sample. Thus, in the above 
comparison, x? was computed as the sum of § comparisons of observed 
and expected frequencies (for 0, 1, 2, 3, and 4 or more occurrences per 
milepost). However, the expected frequencies were based (1) on the 
total frequency of the sample (i.e. 218) and (2) on the sample mean 
expectation per milepost (i.e. one). Therefore, the degrees of freedom 
for this value of x? is 5—2 or 8. 


II. Student Typing Errors 
Data 


Number of typing strokes between errors for nine typists during 
definite intervals throughout one week, for which period their accuracy 
should remain rather static. 


Analysis 


Using the mean number of strokes per error for each typist as a mile- 
post, the observed frequency distribution was compared with the theo- 
retical frequency as determined from the Poisson distribution (expo- 
nential theory). 
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Result 
Freq. of Occurrence Observed Expected 
per Milepost Freq. Freq. 
0 135 135.0 
1 140 135.0 
2 58 67.5 
3 29 22.5 
4 or more 5 7.0 
Total 367 367.0 











a.f.=3 x7 =4.0 P=0.26 


III. Lost or Forgotten Identification Badges and Cards 
Data 


A. Number of identification badges and cards lost or forgotten each 
day, Tuesday through Friday, for a period of 8 weeks at a factory. The 
Monday record was excluded as the forgetting rate was consistently 
higher than on other days. The records of Saturdays and Sundays were 
also excluded, since an unknown fraction of the employees was present 
on these days. The number of employees decreased about 15 per cent 
during the period. The number of badges and ID cards lost or for- 
gotten was adjusted each week by the corresponding decrease in total 
number of employees. 

B. Serial order and frequency with which employees appeared at 
three different entrances of the plant without identification badges or 
cards on three successive Mondays. 








Gate 20 Gate 60 Engineering 





268 32 85 | 22 205 260 65 | 29 81 
146 400 24; 110 70 579 460] 23 162 
237 221 970} 464 150 123 6 | 177 93 
2 62 73 | 41 12 28 210 14 51 
56 99 163 | 148 285 5 14 10 75 
27 9 148; 60 24 220 190 6 17 





23 
Total Employees Ob- 
served 3047 3751 761 
Total Errors 18 24 13 





Employees per Error 169 156 58.5 























_— = a a 
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Analysis 


A. The data for 33 days were compared with the exponential theo- 
retical frequencies computed from the Poisson distribution. The ob- 
served number of badges or ID cards lost or forgotten ranged from 0 to 
40 per day. The expected number of days exhibiting each integral value 
of number forgotten were computed but combined into arbitrary classes 
giving approximately equal expected number of days in each class. 

B. The mean length of interval between failures at each gate was 
used as a milepost and the observed frequency was compared with the 


theoretical frequency as determined from the Poisson distribution. 













































































Result 

A 
Number Number of Days Number Number of Days 
of Badges of Badges 
Forgotten Ob- Lost Ob- 

Daily served Expected Daily served Expected 

0 to 23 9 6.96 0 to 3 11 8.20 
24 to 26 6 6.74 4 7 5.65 
27 to 29 6 7.35 5 2 5.78 
30 to 32 7 5.88 6 2 4.94 
33 or more 5 6.07 7 or more 11 8.43 
Total 33 33.00 Total | 33 33.00 
d.f.=3 x? = 1.33 P=0.72 d.f.=3 x? =6.29 P=0.i0 
Number of Number of Days Number Number of Days 

of ID of ID 

Forgotten Ob- Lost Ob- 

Daily served Expected Daily served Expected 
Oto 7 6 5.70 0 4 5.36 
8& 9 8 7.06 1 12 9.73 

10 & 11 8 7.87 2 9 8.85 
12 & 13 1 6.14 3 or more 8 9.06 
14 or more 10 6.23 

Total | 33 | 33.00 Total 33 33.00 
a.f.=3 x? =6.83 P=0.08 d.f.=2 x? =1.01 P=0.60 


























No. of Fail 
per Milepost. Observed Expected 
0 21 20.2 
1 20 20.2 
: 10 10.1 
3 or more 4 4.5 











d.f.=2 x? =0.1 P=0.95 


IV. Payroll Check Errors 
Data 


Serial numbers of payroll checks spoiled in tabulating either because 
of errors in the check-writing machines or because of errors made by 
personnel in sorting and feeding cards into the machines. The first 
examination of the data indicated that check spoilage could not possi- 
bly be an exponential occurrence, for far too many checks were spoiled 
close together (too many short lives). Examination of all the adjacent 
or close-together failures showed that groups of them were from the 
same cause. The machine failed only once but produced several errors. 
Each group of errors was then counted as a single occurrence and the 
period of faulty operation was removed from the record. Further in- 
vestigation of the causes of failure in the remaining spoiled checks re- 
vealed no tendency that cou!d not occur by the theory. Although 144 
spoiled checks were observed, only 99 cases of failure were judged to 
have occurred. 


Analysis 
The mean length of interval between voided checks was used as a 


milepost and the observed frequency compared with the theoretical as 
determined from the Poisson distribution (exponential theory). 


Result 














Freq. of Occurrence Observed Expected 
per Milepost Freq. Freq. 
0 32 36.42 
1 44 36.42 
2 16 18.21 
3 or more 7 7.95 
Total 99 99.00 











x? =2.5 P=0.29 

















V. Linotype Machine Failure 


Data 


Number of failures per day for six successive days for 22 linotype 


machines. 


Analysis 


The ratio, ).(X —X)?/X of a series of Poisson distributions exhibits 
a x? distribution. The ratio was calculated for each machine, X being 
the number of failures each day and X being the mean daily failure 
rate, and the distribution of these values compared with the expected 
x? distribution for 5 degrees of freedom. 


























Result 
Tabular Values Pp bili Observed Expected 

of x? robability Freq. Freq 
0 -3.00 P>0.70 4.0 6.6 
3 .01-4.35 0.70>P>0.50 7.0 4.4 
4.36-6 .06 0.50>P>0.30 5.0 4.4 
6.07 or more P <0.30 6.0 6.6 
Total 22.0 22.0 

d.f.=3 x? =2.7 P=0.45 


VI. Powered Hand Tool Failure 


Data 


Data on dates of repair calls on 15 hand electric drill motors. 


Analysis 


Mean number of days between failures for each drill was used as a 
milepost and frequency distribution compared with the theoretical 
Poisson distribution (exponential theory). 














Result 
Number of Failures Observed Expected 
per Milepost Freq. Freq. 
0 66 60.4 
1 49 60.4 
2 36 30.2 
3 or more 13 13.0 
Total 164 164.0 











df.=2 x2=3.8 


P=+0.16 
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VII. Automatic Calculating Machine Failure 
Data 

Number of service calls on each of 79 calculators during 1946. 
Analysis 
The observed frequency distribution of service calls was compared 


with expected frequency derived from the Poisson distribution (expo- 
nential theory). 














Result 
Number of Service Observed Expected 
Calis Freq. Freq. 
0 36 31.4 
1 24 29.0 
2 10 13.4 
3 or more 9 5.2 
Total 79 79.0 











df.=2  x7=5.2  P+0.08 


VIII. Electric Motor Centrifugal Switch Failure 
Data 
Number of operating cycles to failure of 17 electric motor centrifugal 
switches. 
Analysis 
The mean number of cycles to failure was used as a milepost and the 


observed number of failures per milepost compared with the theoretical 
as determined from the Poisson distribution (exponential theory). 




















Result 
Insufficient data renders agreement with theory inconclusive. 
Number of Failures Observed Expected 
per Milepost Freq. Freq. 
0 4 6.25 
1 9 6.25 
2 or more 4 4.50 
Total 17 17.00 





x? =2.1 P=0.15 
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IX. Radar Set Component Failure 
Data 


Lifetime in hours of tubes, resistors, networks, crystals, knobs, trans- 
formers, relays, and capacitors used in aircraft radar sets. 


Analysis 


Comparison of observed distribution with the exponential theoretical 
distribution defined by 


Nf(t) = (N/The-!? 


where N is the number of failures and T is the sample estimate of mean 
time to failure, except as noted. 

Where a sufficient number of lives was available, the data consisted 
of the lives of one model of component, used in the same unit of like 
radar sets. For components having too few lives recorded to be handled 
individually, several components of the same type but of different 
models and used in different radar units and sets were combined into a 
single distribution. Finally, a combined distribution was formed from 
different types of components used in different radar units and sets. 
These last consisted of small samples of from 5 to 20 lives for each 
model of component. 

It is a false assumption that a distribution formed from two or more 
failure distributions having different mean times-to-failure is homoge- 
neous. It can be shown that mixing two or more exponential distribu- 
tions with different mean times-to-failure produces a hybrid distribu- 
tion differing somewhat from exponential failure. However, even in the 
mixed data a tolerable agreement with the theory was obtained. 














Result 
A 
V7 TRANSMITTER TUBE 
h Observed Expected 
¢ (hr.) Freq. Freq. 
0- 25 109 101.9 
25- 50 42 46.2 
50— 75 17 21.6 
75-100 7 9.8 
100-up 13 8.5 
Total 188 188.0 











d.f.=3 x? = 4.93 P=+0.18 
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The distribution chart for this tube (see Fig. 2) is disturbing in that 
the number of failures during the first three five-hour periods increases 
until a maximum failure rate is reached during the period of from 15 to 
20 hours of operation. The theory predicts that the failure rate should 
decrease with increasing operating time. It is believed, however, that 
the apparent discrepancy between observed data and theory is caused 
by the method of reporting fligh} hours as failure time. A component 
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Figure 2. Distribution of failure frequency with 
operating time—type V7 vacuum tube. 


which was installed new at the beginning of a ten-hour flight and which 
failed after one hour of flight would be reported as failed at the end of 
the flight, indicating ten hours to failure rather than one hour. Thus 
the lives recorded in the distribution chart are longer than the true 
lives. Moving the lives from the recorded position on the distribution 
chart to the true position would increase the number of occurrences in 
the first fifteen hours to approach the quantity expected. 





The V-7 vacuum tube had a short mean life and was the only com- 
ponent displaying this low failure rate at short life. This is understand- 
able since, in longer life components, the shift in lives by inaccurate re- 
porting is smaller than the first time interval into which failures were 
collected. The shift in the distribution then is so slight as to be unde- 
tectable. 


Vv600 INDICATOR TUBE 








Observed 
Freq. 


_ Expected 
Freq. 











29 
22 
12 
10 
10 

9 

8 


28.8 
20.0 
14.8 
10.2 
12. 





100 








d.f.=5 


x? =2.48 


P=0.78 


V805 TRANSMITTER TUBE 








t (hr.) 


Observed 
Freq. 


Expected 
Freq. 





0- 40 
40— 80 
80-120 

120-160 
160-200 
200-240 
240-280 
280-320 
320-360 
360—400 
400-440 
440-480 
480-520 
520-560 
560-600 
600-700 
700--800 
800—up 


166 
151 
132 
98 
73 
45 
53 
40 
23 
26 
24 
9 

9 

8 

9 
17 
8 
12 


180.5 
145. 
115. 
92 
73 
59 
47. 
37 
30. 
24 
19 
16. 
11 
9 
8. 
13 
8. 
10. 


CNMNODAOWWNONWOUROSO 





Total 





903 





902. 


i°,2) 





d.f.=16 


x?=17.8 


P=0.34 
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See Fig. 3. 
D 
TRANSMITTER TUBE 4 
Observed Expected : 
¢ (hr.) Freq. Freq. 
0—- 50 100 94.7 
50-100 68 68.1 
100-150 48 50.2 
150-200 31 34.0 
200-300 42 43.8 
300—400 21 22.2 
400-600 27 24.0 
Total 337 337 .0 
a.f.=5 =1.13 P=0.95 
120 
100 
mia 
> «| WI 
2 
La 
re 
= 40 X 
ne 
20 to 
0 it = 
0 100 200 300 400 500 600 700 800 900 1000 
OPERATING HOURS TO FAILURE 
FicureE 3. Distribution of failure frequency with operating time—type V805 





vacuum tube used in transmitter. 
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E 

R105 RESISTOR USED IN PE218 CONVERTER 
— h Observed Expected 

¢ (br.) Freq. Freq. 

on 0- 20 92 86.8 

20- 40 60 63.1 

40— 60 49 45 .6 

60— 80 35 33.3 

80-100 16 24.1 

100-140 26 30.4 

140-180 22 15.9 

180—up 17 17.8 

Total 317 317.0 

d.f.=6 x? =5.13 P=0.50 
F 

R105 RESISTOR USED IN PE218D CONVERTER 
h Observed Expected 

¢ (hr.) Freq. Freq. 

0- 40 124 115.2 

40— 80 84 85.5 

80-120 57 67.1 

120-160 51 51.5 

160-200 44 38.4 

200-280 57 53.4 

280-400 31 41.3 

400-560 23 21.9 

560-up 15 11.7 

Total 486 486.0 











d.f.=7 x? =6.74 P=0.47 








R348 RESISTOR USED IN RECEIVER 















































h Observed Expected 
¢ (hr.) Freq. Freq. 
0- 50 22 25.5 
50-100 16 18.5 
100-150 15 13.4 
150-200 12 9.7 
200-300 18 12.2 
300-up 10 13.7 
Total 93 93.0 
d.f.=4 x? =5.39 



















































































H 
Z303 NETWORK USED IN RF UNIT 
h Observed Expected 
é (hr.) Freq. Freq. 
0—- 20 14 16.4 
20— 40 13 12.9 
40— 60 12 10.6 
60— 80 8 7.9 
80-100 7 6.2 
100-150 11 10.5 
150-up 12 12.5 
Total 77 77.0 
a.f.=5 x? = 0.67 
I 
Y301 CRYSTAL USED IN RF UNIT 
Observed Expected 
¢ (hr.) Freq. Freq. 
0—- 50 29 42.2 
50-100 43 32.1 
100-150 28 26.3 
150-200 23 20.8 
200-300 28 28.7 
300-400 14 18.2 
400-600 22 18.2 
600-up ll 11.5 
Total 198 198.0 








d.f.=6 





x? = 9.97 
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_ J 
E103 KNOB USED IN PE218D CONVERTER 
_ 7 Observed Expected 
¢ (br.) Freq. Freq. 
0- 20 60 49.4 
20- 40 56 47.2 
40— 60 40 38.4 
_ 60-100 47 66.9 
100-140 46 50.2 
_ 140-180 29 40.6 
180-220 41 31.9 
220-260 25 24.0 
260-300 20 19.7 
300-400 29 31.5 
400-500 22 17.5 
= 500-up 22 19.7 
Total 437 437 .0 
d.f.=10 x? =17.31 P=0.07 
K 
_ COMBINATION OF TRANSFORMERS 
‘ Observed Expected 
t (hr.) Freq. Freq. 
0— 50 36 33 .6 
50-100 19 27.9 
. 100-150 23 21.0 
150-200 20 17.4 
200-300 33 24.1 
- 300—400 17 14.9 
400-up 16 25.1 
Total 164 164.0 











a.f.=5 x? = 10.45 P=0.07 













COMBINATION OF RELAYS 
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h Observed Expected 
¢ (br.) Freq. Freq. 
0- 50 61 55.1 
50-100 53 41.0 
100-150 26 28.7 
150-200 13 21.3 
200-300 17 26.3 
300-400 11 13.7 
400-up 20 14.9 
Total 201 201.0 

d.f.=5 x? = 13.18 P=0.02 


COMBINATION OF CAPACITORS 










































h Observed Expected 
¢ (hr.) Freq. Freq. 
0— 50 41 41.2 
50-100 33 31.6 
100-200 42 46.4 
200-300 25 29.7 
300—400 28 18.9 
400-600 21 19.7 
600-up 11 13.5 
Total 201 201.0 

a.f.=5 x” =5.66 P=0.35 
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N 
COMBINATION OF VARIOUS COMPONENTS 
h Observed Expected 
¢ (hr.) Freq. Freq. 
0- 50 82 88.1 
50-100 75 66.8 
100-150 49 50.2 
150-200 45 39.3 
200-250 27 28.5 
250-300 17 21.3 
300-350 16 16.6 
350-400 13 12.3 
400-500 11 16.2 
500-600 10 9.4 
600-up 16 12.3 
Total 361 361.0 
x? =6.01 


X. Commercial Aircraft Radio Tube Failure 


Data 


Number of commercial aircraft radio tubes rejected in each 100-hour 
interval of life from operating records on 1,912 tubes [3]. Those rejected 
on installation (initial failure) and those still operating satisfactorily at 
the conclusion of the observation period (2,200 to 3,000 hours) are ex- 


cluded from the analysis. 


Analysis 


The observed frequency distribution was compared with that ex- 
pected from the exponential frequency distribution. 





140 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 
Result 








Time Interval Observed Expected 
(hr.) Freq. Freq. 





O— 100 79 66 
100— 200 59 55 
200— 300 39 47 
300— 400 29 39 
400— 500 27 33 
500— 600 24 28 
600— 700 25 23 
700— 800 23 19 
800— 900 21 16 
900-1000 13 14 

1000-1100 9 11. 
1100-1200 9 10 
1200-1300 10 
1300—1400 12 
1400-1500 6 
1500-1600 5 
1600-1800 10 
1800-2000 7 
2000-2900 11 


Downe OnNnNHONHDONH RAH OD 











Total 418 





d.f.=17 x? =15.81 P=0.53 


XI. Dry Cell Failure 
Data 
Lifetimes in minutes of 100 flashlight cells. 


Analysis 


The chi-square index was used to test divergence of observed fre- 
quencies from theoretical frequencies as determined from normal dis- 
tribution theory. The frequency distributions are set up in intervals of 
one and two standard deviations from the mean. 


Result 


Fig. 4 indicates that the distribution conforms to the normal. 
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Deviation from 
Mean Life in 
Standard Deviations 


Number Observed Expected 
Minutes Freq. Freq. 





Less than —1 Less than 706 13 16 
—1 tod 706-746 36 34 
0 tol 746-786 38 34 
More than 1 Over 786 13 16 














Total 100 100 





df.=1  x2=1.713 














FREQUENCY OF FAILURE 

















re) i 
500 600 700 800 900 
OPERATING TIME TO FAILURE (MINUTES) 





FiaureE 4. Frequency distribution of dry cell battery failure 
with operating time. 


XII. Incandescent Lamp Failure 
Data 


Lifetimes in hours of 417 forty-watt 110-volt internally frosted in- 
candescent lamps taken from 42 weekly quality control forced-life test 
samples. Close control of both the manufacturing processes and the 
conditions of test causes a normal theory of failure to be expected. 
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The observed frequency distribution was compared in three ways 
with frequency expected from a normal theory; A. by use of x? index 
of dispersion; B. by computation of the measure of skewness and kur- 
tosis; and C. Fig. 5, in which the observed histogram is compared with 
the normal curve. 
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Result 
A 








Deviations from Mean Life Observed Expected 
in Standard Deviations Freq. Freq. 





Less than —2 12 
—2to —1 49 
—-lto 0O 153 
Oto 1 147 
lite 32 45 
More than 2 11 





Total 417 











72 =5.3 P=0.16 
































FREQUENCY OF FAILURE 
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Fiaure 5. Frequency distribution of incandescent lamp failure 
with operating time. 
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B 


t test for skewness: 0.37 (no significant variation from symmetry) 
t test for kurtosis: 4.4 (significant deviation—the distribution is more 
peaked at the mean and flatter in the flanks than a true normal.) 
The above deviation from a normal distribution possibly arises from 
the questionable assumption of homogeneity of the data. The 417 fail- 
ures consisted of samples taken from 42 separate manufacturing runs, 
each of which probably had a slightly different normal distribution. 
The combination of several normal distributions all with the same 
mean but with different standard deviations, most of which were small 
but a few of which were large, would produce an overly peaked distribu- 
tion such as that found for the lamps. 


XIII. Bus Motor Failure 
Data 


Number of miles to first and succeeding major motor failures of 191 
buses operated by a large city bus company. Failure was either abrupt, 
in which some part broke and the motor would not run; or, by defini- 
tion, when the maximum power produced, as measured by a dyna- 
mometer, fell below a fixed percentage of the normal rated value. Fail- 
ures of motor accessories which could be easily replaced were not in- 
cluded in these data. 


Analysis 


A. First and second bus motor failures are compared by chi-square 
index with the normal theory distribution curve. 

B. Second and succeeding bus motor failures are compared by chi- 
square index with exponential distribution curve. 


Result 


The normal curve appears to approximate the distribution of the 
first bus motor failures, while the distributions of the third, fourth, and 
fifth bus motor failures appear to be approximated by the exponential 
distribution. 

The distribution of the second failures of the bus motors is not de- 
scribed by either theoretical curve but looks like a combination of both. 
(See Fig. 6.) 
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AN ANALYSIS OF SOME FAILURE DATA 


A. Comparison with normal theory 
INITIAL BUS MOTOR FAILURES 
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Distance Interval, 
Thousands of Miles 


Observed Number 
of Failures 


Expected Number 
of Failures 











Less than 20 ‘} 3.6 
20- 40 11 8.2 
40— 60 16 18.5 
60- 80 25 31.3 
80-100 34 39.9 
100-120 46 38.3 

120-140 33 27.6 
140-160 16 15.0 
160-180 } - 
180-up 2 2.5 

Total 191 191.0 








d.f.=5 


x? = 9.93 P=0.08 


SECOND BUS MOTOR FAILURES 








Distance Interval, 
Thousands of Miles 


Observed Number 
of Failures 


Expected Number 
of Failures* 











0O- 20 19 9.2 
20- 40 13 12.9 
40- 60 13 16.9 
60- 80 15 18.7 
80-100 15 17.2 
100-120 18 13.1 
120-160 i} e2 
160-up 4 3.0 

Total 104 104.0 











d.f.=4 


x? = 16.57 P <0.01 


* From truncated normal distribution. 


B. Comparison with exponential theory 


Expected frequency: Njf(t)=(N/T)e-/? where T is the sample esti- 


mate of mean distance to failure. 
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SECOND BUS MOTOR FAILURES 








Distance Interval, 
Thousands of Miles 


Observed Number 
of Failures 


Expected Number 
of Failures 





0—- 20 
20— 40 
40— 60 
60— 80 
80-100 

100-120 
120-160 
160—up 


19 
13 
13 
15 
15 
18 

7 

4 


— — et DD 
COO Or FOO 
NWON, FOO > 


— 








Total 





104 


— 
oO 
— 
i) 





d.f.=6 


x? = 39.53 P<0.01 


THIRD BUS MOTOR FAILURES 








Distance Interval, 
Thousands of Miles 


Observed Number 
of Failures 


Expected Number 
of Failures 





0- 20 
20— 40 
40— 60 
60— 80 
80-100 

100—up 


27 
16 
18 
13 
11 
16 


31.2 
21.6 
14.8 
10.3 

7.2 
15.9 





Total 





101 





101.0 





d.f.= 


x? = 4.42 P=0.35 


FOURTH BUS MOTOR FAILURES 








Distance Interval, 
Thousands of Miles 


Observed Number 
of Failures 


Expected Number 
of Failures 





0-20 
20-40 
40-60 
60-80 
80-up 


34 
20 
15 
15 
12 


36. 
22. 
13. 





Total 





96 








d.f.=3 


x? =5.34 P=0.16 











E 1952 
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FIFTH BUS MOTOR FAILURES 














Distance Interval, Observed Number Expected Number 
Thousands of Miles of Failures of Failures 
0-20 29 36.0 
20-40 27 20.7 
40-60 14 12.0 
60-80 8 6.9 
80—up 7 9.4 
Total 85 85.0 











d.f.=3 x? =3.40 P=0.32 


C. Discussion of results 


Consideration of the circumstances attendant upon the first and sub- 
sequent motor failures makes the peculiarities of the distribution curves 
(Fig. 6) appear reasonable. A normal distribution of mileage to first 
failure can be expected, for the moving parts of the motor slowly abrade 
each other until, when a sufficient amount of metal is worn away, they 
either break or no longer perform their function with satisfactory effi- 
ciency. This expectation is confirmed by first failure being caused 
singly or in combination by worn cylinders, pistons, piston rings, 
valves, camshafts, connecting rod or crankshaft bearings, etc. There 
were, however, a considerable number of failures in which moving parts 
fractured at low mileages, indicating manufacturing errors or inade- 
quate or improper maintenance or repair. These latter conditions were 
exponential type occurrences which should follow a different distribu- 
tion. Unfortunately these data on the cause of failure were so fragmen- 
tary and incomplete that a segregation of the two types of failure could 
not be accomplished. The indication of failure resulting from two dis- 
tinct causes however was too strong to be disregarded. Fig. 7 shows a 
normal and an exponential distribution curve which might be com- 
bined to describe the observed initial failure distribution of the bus 
motors. 

An exponential distribution of the third and subsequent failures ap- 
pears logical in that by the time the motor has been overhauled twice 
it consists of components in a scattered state of wear. The individual 
parts may each exhibit a normal distribution of failure with operating 
time, but with components in random stages of wear a motor has an 
equally likely chance of failing during any period of operation. This 
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does not mean that the chance of failure during equal operating periods 
is the same after each overhaul. The mean mileage to failure after the 
first overhaul is less than that of first failure, and after each subsequent 
overhaul, a shorter mean mileage to failure is exhibited. 
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Figure 7. Frequency distribution of first bus motor failure with miles traveled. 


The second failures are not predominantly due to either exponential 
or normal theory causes and can therefore be expected to exhibit a dis- 
tribution characteristic of neither cause. 
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CLASSIFICATION AND ANALYSIS OF PARTIALLY 
BALANCED INCOMPLETE BLOCK DESIGNS 
WITH TWO ASSOCIATE CLASSES 


R. C. Bosz anv T. SHimamMoto 
University of North Carolina 


I. INTRODUCTION 


NCOMPLETE block designs are now in fairly general use, especially the 

balanced incomplete block and the lattice designs. One of the au- 
thors in collaboration with Nair [1] introduced in 1939 a wider class of 
designs, viz. partially balanced incomplete block designs which in- 
cluded as a special case the balanced incomplete block designs and the 
square lattices. A slight generalization by Nair and Rao [2], resulted in 
the inclusion of cubic and other higher dimensional lattices as special 
cases. Recently other special cases of partially balanced designs are 
beginning to be used, for example, the rectangular lattices of Harsh- 
barger [3, 4, 5] and the linked block designs of Youden [6]. It is the 
object of this paper to review the more important properties of partially 
balanced designs with two associate classes and to give both the intra- 
block and the combined intra- and inter-block analysis in a simplified 
and compact form, together with an illustrative numerical example. A 
special feature is the division of these designs into a small number of 
distinct types, for each of which the association scheme can be ex- 
plicitly exhibited. This simplifies the numerical computations as well 
as the interpretation of the results. A number of useful designs belonging 
to each type have been given for illustrative purposes, but no complete 
tabulation has been attempted. Complete tables (up to ten replications) 
will be published by the Institute of Statistics, University of North 
Carolina. 


II. DEFINITION OF PARTIALLY BALANCED DESIGNS AND THE 
RELATIONS BETWEEN THE PARAMETERS 


1. An incomplete block design is said to be partially balanced if 
it satisfies the following conditions: 

(i) The experimental material is divided into b blocks of k units each, 
different treatments being applied to the units in the same block. 

(ii) There are v treatments each of which occurs in r blocks. 

(iii) There can be established a relation of association between any 
two treatments satisfying the following requirements: 

(a) Two treatments are either Ist, 2nd, - - - , or m-th associates. 
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(b) Each treatment has exactly n,, 7-th associates ({=1, 2, +--+, m). 

(c) Given any two treatments which are 7-th associates, the number 
of treatments common to the j-th associates of the first, and the k-th 
associates of the second is pj,‘ and is independent of the pair of treat- 
ments with which we start. Also pjx.‘=pi;'. 

(iv) Two treatments which are 7-th associates occur together in ex- 
actly A; blocks. 

The numbers », b, r, k, A1, A2, + * +, Am; M1, M2, * * * , Mm May be called 
the parameters of the first kind and the numbers p,‘ (i, j, k=1, 
2, - +--+, m) the parameters of the second kind, belonging to the design. 
Thus there are 2m+4 parameters of the first kind, and m?(m+1)/2 
parameters of the second kind (since pj.‘= p;;*). 

In the original formulation of the definition of partially balanced de- 
signs [1], \1, A2, - - *, Am Were considered to be different. The modifica- 
tion introduced in [2] consisted in allowing some of the \’s to be equal. 
The definition given here has been so rephrased as to bring out the fact 
that the association relations between the treatments of a partially 
balanced design are governed only by the requirements of paragraph 
(iii) of the definition, and do not depend on how the treatments are 
distributed in blocks. The association relations depend only on the 
parameters m, M2, +--+ -, Mand px *(t,j, k=1, 2, - ++, m), which are 
connected by the formulae (2.23), (2.24), and (2.25) given in the next 
paragraph. 

2. The following relations hold between the parameters: 


(2.20) (i) bk = ur 
(2.21) (ii) m+ m+:++ +m =v—1 
(2.22) (iti)  midy + meade + +++ + mdm = r(k — 1) 
(2.23) (iv) >> pei = 0; (if ¢ ¥ j) 

kel 
(2.24) =n;—1 (if t=) 
(2.25)  (v) Nip? = Np? = mpi. 


These relations have been proved in [1], where it is shown that if pa- 
rameters of the first kind satisfying (2.20), (2.21), and (2.22) are as- 
signed, then in virtue of the relations (2.23), (2.24), and (2.25), only 
m(m?—1)/6 of the parameters of the 2nd kind are independent. 

3. Partially balanced designs with only two associate classes (m= 2) 
are of special interest. These designs depend on 8 parameters of the Ist 
kind v, b, r, k, 1, Ae, M1, M2 connected by the three relations (2.20), 
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(2.21), and (2.22), and 6 parameters of the 2nd kind px ‘(t, 7, k=1, 2) 
connected by the relations (2.23), (2.24), and (2.25). The parameters 
of the 2nd kind may be exhibited as elements of two symmetric ma- 
trices 


1 1 2 
(2.30) Pi = ~ Pix? = al _ 
Pa’ Pan Pa” Pre 

The relation of association between the treatments, referred to in 
paragraph (iii) of the definition of partially balanced designs can be 
effected by giving a rule, which tells us what kind of associates two 
given treatments are. In particular when m=2, the rule will specify 
whether two treatments are Ist associates or 2nd associates. The rule 
however must be such that the conditions (b), (c) are satisfied. 

As an illustration let v=10, m=2. Arrange the ten treatments a, b, 
c, d, e, f, g, h, 7, j in the scheme 


*abed 
e*erye 
(2.31) be* kh 
cf h * j 
qgij * 


Suppose that the rule of association is that two treatments are Ist 
associates if and only if they occur together in the same column of the 
scheme, otherwise they are 2nd associates. 

It can now be verified that each treatment has 6 Ist associates, and 
3 2nd associates. Thus condition (b) is satisfied with n,=6, ne=3. The 
first and second associates of a, b and 7 are given below 


Treatment Ist associates 2nd associates 


a b, C, d, é, A g t, j; h 
(2.32) b a, Cc, d, é, h, t I, 9; j 
a b, é, h, d, 9; j a, C, f 


The treatments a and 6 are first associates. The number of treat- 
ments which are 1st associates of a as well as b is 3 since the only such 
treatments are c, d and e. Thus pu!=3. It can be verified that starting 
with any two treatments which are Ist associates we get exactly 3 
treatments which are Ist associates of both. This verifies the condition 
(c), so far as the constancy of pu! is concerned. Again f and g are the 
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only treatments common to the Ist associates of a and 2nd associates 
of b. Starting with a and b we thus get p:2:'=2, and we can then verify 
the constancy of this number, whichever pair of 1st associates we start 
with. The same holds for other py’s. The parameters py‘ when ex- 
hibited in the form (2.30) are 


om wa wed 


Remembering that n,=6, ne=3 and m=2, we can now verify the 
relations (2.23), (2.24), (2.25). 

As the scheme (2.31) pictorially summarizes the association relations 
between the treatments we may call it an “association scheme.” To get 
a design based on this association scheme we now have to arrange the 
10 treatments into blocks, satisfying the requirements (i), (ii) and (iv) 
of the definition. These blocks then give the plan of the design. It may 
happen that there is more than one design based on the same associa- 
tion scheme. For example there are four different designs based on the 
association scheme discussed here. These are the designs number 1, 2, 
3 and 4 of Table II A and B. Table II A gives the parameters », b, r, k, 
Ai, Ae, 2, C1, C2, A, H, where n is the side of the square in the association 
scheme, and ¢, cz, A and H are certain constants useful for analysis of 
the results of the design. Their use is explained in detail later in sections 
VI and VII. Table II B gives the association scheme and the plans 
(the blocks are given by the columns of the plan). As an illustration 
consider the plan for the design no. 4. Since b=10, k=4, r=4, there 
are 10 blocks each containing 4 treatments, and each treatment occur- 
ring in 4 blocks. Since \, = 1 any two treatments which are Ist associates 
occur together in one block, e.g. the treatments a ard b occur together 
in the 2nd block. Since \,=2 any two treatments which are 2nd asso- 
ciates occur together in two blocks, e.g. the treatments a and 7 occur 
together in the 5th and 9th blocks. 

Partially balanced designs for which the association between treat- 
ments is of the type discussed above are said to belong to the triangular 
type. A review of all the known partially balanced designs with only 
two associate classes, recently undertaken by the authors revealed the 
interesting fact that designs of this kind can be grouped into a number 
of distinct types, according to the nature of the corresponding “associa- 
tion scheme.” The following types of association schemes have to be 
considered : 

(1) Group divisible (GD) 
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(2) Triangular (T) including the subtypes triangular doubly linked 
blocks (TDLB) and triangular singly linked blocks (TSLB) 

(3) Singly linked blocks (SLB) 

(4) Latin square with 7 constraints (L,) 

(5) Cyclic (C). 


III. THE GROUP DIVISIBLE TYPE 


1. The simplest type of partially balanced designs with two associ- 
ates classes is the group divisible shortly denoted by GD. In this case 
v=mn. The treatments can be divided into m groups of n treatments 
each, such that any two treatments in the same group are Ist associates 
and two treatments in different groups are 2nd associates. The associa- 
tion scheme can be exhibited by placing the treatments in an nXm 
rectangle, where the columns form the groups. Clearly 


(3.10) m=n-—1, nm, = n(m — 1). 
Hence, 
(3.11) (n — 1). + n(m — 1)d2 = r(k — 1), 


and the parameters of the second kind are uniquely determined by m 
and n. In fact 


(3.12) a 0 = ( 0 ae 
™™ 0 ak ol —-_ n—1 n(m—2)/° 


A detailed study of the combinatorial structure and methods of con- 
structing GD designs will appear elsewhere [7]. We shall, however, give 
some properties and illustrations of the main subdivisions into which 
these designs fall. 

2. For every GD design the following inequalities can be shown to 
hold 


(3.20) r=, rk — vdeo = 0. 


A GD design is said to be singular if r=). A singular GD design is 
always derivable from a corresponding balanced incomplete block de- 
sign on replacing each treatment by a group of n treatments. These 
groups give the groups of the association scheme. Thus consider the 
balanced incomplete block design with parameters o* =b*=7, r*=k* 
=3, A*=1. The plan for this is given below, the columns representing 
the blocks. 
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abcedefg 

(3.21) bedefga 
defgaobe 


If we take n=2, then we may replace the treatment a by ai, a2 and 
do the same for the other treatments. We then get the singular GD 
design with the parameters 


v= 14,b=7,r=3, k=6, uy =3, y=1, m=7, n = 2, 
the plan for which is shown below: 
Ha ada if n 
Gz be Ce dz ee fe gs 
hada fi gu 
be C2 dz @2 fe ge a& 
hah nah a 
dz @2 fe g2 G2 be & 


The association scheme is 


(3.22) 


(3.23) mHha dh a hi 91 
G2 be Ce de ee fe ge 


In general corresponding to the balanced incomplete block design 
with parameters v*, b*, r*, k*, \*, we shall have a singular GD design 
with parameters 


(3.24) v = no*,b = b*,r = r*, k = nk*, yy = 1, Xe = N*, m = 0*, TN. 
A useful class of singular GD designs is obtained by starting with the 

unreduced balanced incomplete block design with parameters, 

(3.25) v* = ¢, b* = tt — 1)/2, r* =t —1, * = 2, * = 1, 

obtained by taking for blocks all possible pairs out of ¢ treatments, and 


then replacing each treatment by n new treatments (n=2, 3, 4, or 5). 
The resulting singular GD design has the parameters 


v=nt, b= t(t —1)/2, r=t—1, k = 2n, 


(3.26) 
u =t-—1, w= 1, m=t, n. 


3. A GD design is non-singular if r~) . Non-singular GD designs 
can be further subdivided into two classes. 








952 
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A GD design is said to be semi-regular if r>x, rk—vdA2=0. For a 
semi-regular GD design there holds the inequality 


(3.30) b2v—m+1. 


Also each block must contain the same number of treatments from each 
group so that k must be divisible by m. 

A GD design is said to be regular if r>),, rk—vd2>0. For a regular 
GD design there holds the inequality 


(3.31) bz». 


In Table I A we give the parameters of some useful non-singular GD 
designs with r<5. The corresponding association schemes and plans 
are given in Table I B. In the association scheme the columns give the 
groups. In the plans the sets of treatments to be assigned to the same 
block are given by the columns. As already mentioned the constants 
¢1, C2, A, H are useful for analysis and their significance will be explained 
later. 


TABLE IA 
PARAMETERS OF SOME NON-SINGULAR GD TYPE DESIGNS 

















Reference v b r k nv rs m n C1 C2 H A 
No. 
1 10 8 4 5 0 2 5 2 0 1/2 36/5 64/5 
2 12 9 3 4 0 1 4 3 0 1/3 21/4 27/4 
3 12 16 4 3 0 1 3 4 0 1/4 20/3 32/3 
4 12 12 4 4 2 1 6 2 4/7 2/7 13/2 21/2 
5 14 8 4 7 0 2 7 2 0 1/2 52/7 96/7 
6 14 14 4 4 0 1 7 2 0 2/7 13/2 21/2 
7 15 15 4 4 0 1 5 3 0 4/15 27/4 45/4 
8 20 16 4 5 0 1 5 4 0 1/4 36/5 64/5 
9 24 24 5 5 0 1 6 4 0 5/24 44/5 96/5 





IV. THE TRIANGULAR AND LINKED BLOCK TYPES 


1. Another important type of partially balanced design is the tri- 
angular type denoted shortly by 7’. In this case v=n(n—1)/2. We take 
an nXn square, and fill the n(n—1)/2 positions above the leading di- 
agonal by the different treatments a, b, c, etc., taken in order. The 
positions in the leading diagonal are left blank, while the positions be- 
low this diagonal are filled so that the scheme is symmetrical with re- 
spect to the diagonal. Two treatments in the same column are Ist as- 
sociates, whereas two treatments which do not occur in the same 
column are 2nd associates. The association scheme for the case n=5 is 
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TABLE I B 
ASSOCIATION SCHEMES AND PLANS FOR SOME NON-SINGULAR GD TYPE DESIGNS 








Reference Association 


No. scheme Plan 





1 acegi abab abab 
bdfhj cdecd dcde 
effe effe 
ghho hagh 
tijgs tijj 
Rep. 1, 2 | Rep. 3, 4 
abcd ahefkjlig 
efgh hebkjlicd 
ijkl bfklicgda 
cgdahebsj 
abe alik aefd 
def bhijd lajb 
ghi ecgef kchi 
jkl Rep. 1 Rep. 2 Rep. 3 Rep. 4 





abcdef abcdefghijkl Rep. 1 
ghijgkl bedefghijkla Rep. 2 
efghijklabed Rep. 3 
ghijklabcdef Rep. 4 


acegikm abababab 
bdfhjln edecddcde 
effeeffe 
ghhoghggh 
tigjgiijj 
ReELLIiLEE 


mmmmnnnn 


abcdefsg bedefgaijzgklmnh 
hijkimn ecdefgabjkimnhi 
efgabcdlimnhijk 
hijklmnabcdefg 





abcdefghijklmno 
cdefghijklmnoab 
defghijklmnoabe 
Imnoabcdefghijk 





abchefjkmrinlast 
hefijikmdrgnqopsab 
efjkmdrgilopasatbe 
drginlqoptabchjk 
ginlqagopstachefmd 





abcdefghijgklimnopaqrstuvwez 
cdefghijgklmnopaqrstuvwrab 
parstuvwrabcdefghijzklmno 
aqrstuvwrabcdefghijklimnop 
tuvwrabcdefghijklmnopaqrs 
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given by (2.31), and this particular case has already been discussed in 
a some detail in section II. In the general case we clearly have 


(4.10) nm = 2n — 4, m = (n — 2)(n — 3)/2. 


— n—2 n-—-3 
a -(-o anche 


4 2n — 8 
oes \. —8 (n—4)(n— ol 


2. In the original paper by Bose and Nair [1] introducing partially 
balanced designs it was pointed out (p. 370-71) that a number of par- 
tially balanced designs can be obtained by dualizing the known bal- 
anced incomplete block designs (i.e., by taking blocks as treatments 
and treatments as blocks), and a number of examples were given. 
Recently Youden [6] has used designs which he calls linked blocks, in 
which two blocks always have the same number of treatments in com- 
mon. For “singly linked blocks” two blocks have one treatment in com- 
mon and for “doubly linked blocks” two blocks have two treatments in 
common. it is easy to see that the dual of any balanced incomplete 
block design in which any pair of treatments occur together \* times, 
is such that any two blocks of the dual Lave \* treatments in common. 
But the dual of a balanced incomplete block design may not always be 
partially balanced, though all the designs given by Youden happen to 
be partially balanced. It has been pointed out by Shrikhande [8] that 
the dual of a balanced incomplete block design with parameters v*, b*, 
r*, k*, \* will always be partially balanced in the following cases: 


(4.20) (a) * = 1, 
(4.21) (b) r= k, k* =k —2, * =2, 


(4.11) 


The parameters for the corresponding partially balanced designs for 
the case (a) are 


(4.220) v=k(rk-—k+1)/r,b=rk-—-k+1,r=r,k=k, 
(4.225) m =r(k—1), m= (k—r)(r—1)(k—-1)/7,mM=1,me=0 





(4.230) —_— (r — 1)(k — 7) ) 
-230) Dix (r — 1)(k — r) (r — 1)(k — \(k —¢ — 1)/r/]’ 
“is r(k —r — 1) 
aa “( k(k — D) 
r(k —r —1) (k — + 2" — 1) - 
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The parameters for the corresponding partially balanced design for 
the case (b) are: 


v = k(k — 1)/2, b = (k — 1)(k — 2)/2, r =k — 2, 
k=k, wy =1, + = 2; 
(4.25) nm, = 2k — 4, ne = (k — 2)(k — 3)/2; 
4.200) pt=( 8 ag aaa) 
k—-3 (k—3)(k — 4)/2 
4 2%x%-8§ 
side sthsadiiineslial 


(4.24) 


(4.265) pat = ( 


It is readily seen that the case (b) comes under the triangular type 
already discussed. This particular subtype of the triangular type we 
may call TDLG (triangular doubly linked blocks). 

The designs under the case (a) with parameters given by (4.220) to 
(4.235) may be said to belong to the type singly linked blocks, shortly 
SLB. The association scheme for these designs is given by the designs 
themselves, since any two treatments are first associates when and 
only when they occur in the same block. There is however a certain 
overlap between this type and the triangular type. In fact if k* =2, 
\*=1, then we have an unreduced balanced incomplete block design 
in the sense of Fisher and Yates [9], with parameters 


(4.27) v* =n, b* = n(n — 1)/2, r* =n —1, * = 2, * = 1, 
and the dual of this is the SLB type design with parameters 
(4.28) v=n(n—1)/2,b=n,r=2,k=n-—1, ry =1, » =), 


with m, m2 and pj. ’s being given by (4.10), (4.11). 

These designs are easily seen to come under the triangular type also. 
In fact, they are obtained by taking for blocks the columns of the 
triangular association scheme. We shall call these designs TSLB (tri- 
angular singly linked blocks) but in order to retain uniqueness of classi- 
fication count them under the triangular type only. 

In Table II A we give the parameters of some useful triangular de- 
signs including TSLB (Ref. nos. 1, 5, 8, 9, 10, 11, 12) and TDLB (Ref. 
nos. 3, 7). The corresponding association schemes and plans are given 
in Table II B. As before the sets of treatments to be assigned to the 
same block are given by the columns. 

In Table III A we give the parameters of some useful designs of the 
SLB type. The corresponding plans are given in Table III B. The plans 
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themselves provide the association scheme since any two treatments 
are first associates when and only when they occur in the same block. 
The parameters px‘ are given by (4.230) and (4.235). 


TABLE IIA 
PARAMETERS OF SOME TRIANGULAR TYPE DESIGNS 








Reference 
No. 


i 
> 
* 
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TABLE IIIA 
PARAMETERS OF SOME SLB TYPE DESIGNS 








Reference 
No. H 4 





31/6 13/2 
36/7 45/7 
57/8 25/2 
46/9 19/3 
64/9 112/9 
51/10 63/10 
91/10 41/2 


ooo oanoa 
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TABLE II B 
ASSOCIATION SCHEMES AND PLANS FOR SOME TRIANGULAR TYPE DESIGNS 








Reference 
No. 


Association 
scheme 


Plan 





1 


*abcd 
a*efg 
be*hi 
cfh*j 
dgij* 


aehjd 


bficg Rep.l 


egbfi 
daehj Rep.2 





Same as for 1 


ahbgiecjdf 
bihedfjgac 
ejcibhdfga 





aahdaj 
hhdffg 
idfiie 
gigejb 
cebcbe 





bjgfaehcid 
jacbhidgefh 
Sbhigedgae 
gebdhjaicf 





*abcde 
a*fghi 
bsf*ijkl 
cgj*mn 
dhkm*o 
eilno* 


afjmoe 
bgkndi 
chlehl 
dibgkn 
eafjmo 





Same as for 5 


aklbnmohcfdiegj 
jamobiclkngdfeh 
onaghbfcidljmke 





Same as for 5 


hdofblgcecna 
dohblfenge 
Jjimemakki 
lfbohdohdj 
ngcengclfbm 
aejamikeik 





*abcdef 
a*ghijk 
bg* lmno 
chl*paqr 
dimp*st 
ejnqs*u 
Skortu*® 


aglpsuf 
bhmqtek  Rep.1 


cinrdjo 


djocinr 
ekbhmqt  Rep.2 
faglpsu 





9, 10, 11, 12 





Similar to 1, 5, 8 





Blocks are obtained as in 1, 5, 8 by taking the 
columns of the association scheme, starting with the 
treatment just below the vacant place indicated by 
a star. When k is even the first k/2 positions in a 
block give a complete replication, and the same holds 
for the last k/2 positions, as in 1 and 8. 








1 


TABLE III B 
ASSOCIATION SCHEMES AND PLANS FOR SOME SLB TYPE DESIGNS 





28, b* =63, r* =9, k* =4, »* 
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By dualizing the corresponding balanced incomplete block design with parameters 
By dualizing the corresponding balanced incomplete block design with parameters 


v* =21, b* =70, r* =10, k* =3, A*=1 
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V. THE LATIN SQUARE AND CYCLIC TYPES 


1. The next class of partially balanced design to be considered is the 
Latin square type with 7 constraints. This type may be denoted by L,. 
The number of treatments is a perfect square say v =n”. The treatments 
may be set forth in a square scheme. For the case i= 2, two treatments 
are lst associates if they occur in the same row or column, and 2nd 
associates otherwise. For the general case we take a set of i—2 mutually 
orthogonal Latin squares (if such a set exists). Then two treatments 
are first associates if they occur in the same row or column, or corre- 
spond to the same letter of one of the Latin squares. Otherwise they 
are second associates. For example, for the design no. 3, in the Table 
IV B, the 25 treatments a to y are arranged in a square, with a super- 
imposed Latin square with the letters A, B, C, D, E. The first associates 
of hare f, g, 7, j, c, m, r, w, b, n, t, u. The ordinary lattice designs belong 
to the Latin square type but do not exhaust it. Clearly 


(5.10) m = i(n — 1), m = (n—1)\(n—7t+1), 


(5.110) Dix? = ( 


v?—3i+n i= ei 
G—D(n-—i4+1) n-dD(n-it41)/’ 


a(t — 1) i(n — 1) ) 


(5.115) pp? = 9 (n—a?+i-2 


In Table IV A, we give the parameters of some useful designs of the 
type L, which are not lattice designs. The corresponding association 
schemes and plans are given in the Table IV B. 


TABLE IV A 
PARAMETERS OF SOME LATIN SQUARE TYPE DESIGNS 








—— t Ca Cs H 





1 2 4/15 8/15 27/4 45/4 
2 0 3 |-3/8 3/16 4 32/9 
3 3 3/25 —3/25 25/3 50/3 
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TABLE IV B 
ASSOCIATION SCHEMES AND PLANS FOR SOME LATIN SQUARE TYPE DESIGNS 








Reference Association 


No. scheme Plan 





1 afhibdege Rep. 1 
fhabdigee Rep. 2 
tbdegeafh Rep.3 
bdigcecefha  Rep.4 





jklimnopghefbcda_ Rep.1 
ghefdabcijklpmno_ Rep.2 
pmnofghedabcijkl Rep.3 





abcdeoklmnzyuvwhijfoqrstp 

eabcdnoklmwryuvghijgfparst 

yuvwrigfghrstpqbcdeak lmno 
Blocks 1-25, Reps. 1, 2, 3 


abcdeoklmnzyuvwhijfgqrstp 

cdeablmnokuvwryjfghiétpar 

mnokluvuwryufghijtparsdeabe 
Block 26-50, Reps. 4, 5, 6 











2. Another important type of partially balanced design is the cyclic 
type shortly denoted by C. For this case instead of taking the treat- 
ments to be a, b, c, ---, it is more convenient to take them as the 
integers 1, 2, ---, v. The first associates of the treatment 7 are 


(5.20) t+dii+ds-:+,t+dn, (mod v), 


where the d’s satisfy the conditions: 

(i) the d’s are all different, and 0<d;<v(j=1, 2, --+,m); 

(ii) among the ,(m,—1) differences d;—d;, (j, j’=1, 2, +--+, m 
jj’) reduced (mod v), each of the numbers d;, dz, ---, dn occurs 
a times, whereas each of the numbers @1, €2, - + -, én, occurs B times, 
where dj, de, - + + , Qn, €1, €2, * * * , €n, are all the different v—1 numbers 
1,2, --+,v—1.[To reduce an integer mod v, we have to subtract from 
it a suitable multiple of v, so that the reduced integer lies between 1 and 
v. For example 17 when reduced mod 13 is 4.] Clearly it is necessary 
that 


(5.21) ma + mB = n(n — 1). 
The parameters p;‘ are given by 


m—-a-—l 
(5.220) pat = ( ¥ ait ), 
m—-a-—-lnm—-mt+atl 


(5 ° 225) Dix? = ( 
m1 


B m—B ) 
—8 m—m+p—1/) 
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The relations (2.23)—(2.24) are obviously satisfied. 

In the Table V A we give the parameters of some useful cyclic type 
designs. In the Table V B we give the corresponding association scheme 
and the plan. The association scheme is specified by giving the set of d’s, 
and the plan is specified by giving the initial block. All the blocks are 
obtained from the initial block by adding j7—1 to every treatment 
(j=1, 2, - +--+, v) and reducing (mod v). For example for the design 
number 1 of Table V A, v=13, and Table V B shows that 


d, = 2, dz = 5, ds = 6, dg = 7, ds = 8, & = 11. 


It is readily seen that among the differences d;—d, reduced (mod 13) 
each d occurs twice, whereas every other non-zero integer less than 13 
occurs thrice. The associates for any treatment are easily determined. 
Thus the first associates of 4 are the treatments 6, 9, 10, 11, 12, and 2. 
The initial block in the plan is given as 1, 3, 9. Hence the completed 
design is given by 


123 4 5 6 7 8 9 10 11 12 18 Rep. 1 
3 4 5 6 7 8 9 10 11 12 138 1 2 Rep.2 
9 10 11 122 13 1 2 3 4 5 6 FT 8 Rep.3 


the blocks being represented by the columns. 


TABLE V A 
PARAMETERS OF SOME CYCLIC TYPE DESIGNS 
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TABLE V B 
ASSOCIATION SCHEMES AND PLANS FOR SOME CYCLIC TYPE DESIGNS 














Reference No. Association scheme (set of d’s) Plan Initial blocks 
1 2, 5, 6, 7, 8, 11 (1, 3, 9 
2 Same as for 1 (1, 2, 5, 13), (1, 2, 5, 11) 
3 Same as for 1 (1, 2, 13), (1, 4, 11), (1, 5, 10) 
4 3, 5, 6, 7, 10, 11, 12, 14 (1, 3, 10, 12), (1, 2, 5, 6) 
5 Same as for 4 (1, 9, 13, 15, 16, 8, 4, 2) 
6 1, 4,5, 6, 7, 9, 13, 16, 20, 22, 23, 24, 25, 28 (1, 16, 24, 7, 25, 23, 20) 
7 Same as for 6 (1, 2, 8, 17, 21, 24, 25, 26) 








VI. ANALYSIS OF PARTIALLY BALANCED INCOMPLETE BLOCK 
DESIGNS WITH TWO ASSOCIATE CLASSES 


1. Let there be v treatments whose effects it is required to compare 
by using a partially balanced incomplete block design with two associ- 
ate classes. We may divide the N experimental units (to which the 
treatments are to be applied) into more or less homogeneous groups 
called blocks, each block containing k units. If the number of blocks is 
b, then 


(6.10) bk = N. 


The design divides the treatments into b sets, such that each set con- 
tains k treatments, and each treatment occurs in r sets. Then 


(6.105) vr = N. 


We shall first consider the case of “ungrouped blocks” i.e. the case 
when the sets of treatments are assigned at random to the b blocks, 
and the k units of a particular set are assigned at random to the units 
of the corresponding block. The designs considered in this paper are 
useful in experimental situations in which there is no interaction be- 
tween blocks and treatments, i.e. the blocks do not act differentially 
with respect to the treatments. Let y;; be the observed result of apply- 
ing the ith treatment to the jth block. We may then write 


(6.11) yge ott +b; 4+ a; 


so that the observation is considered to be the sum of (i) a general com- 
ponent g (ii) a component ¢t;, the effect of the ith treatment (iii) a 
component b;, the effect of the jth block (iv) a random component ¢;; 
with expectation zero, the ¢;;’s being independently distributed. 

It should be noted that to a given j (j=1, 2, - - - , b) only k different 
values of 7 are associated, since the symbol y;,; or t;; is meaningful only 
when the ith treatment occurs in the jth block. 
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A linear function of the observations 


(6.115) Y = Do ais 


is defined to be an intra-block contrast if the sum of the coefficients of 
the k observations coming from the jth block vanishes for j=1, 
2, ---, b. Substituting from (6.11) it is clear that the value of Y is 
independent of g and the b’s. Hence E(Y) is a function of the treatment 
effects only. Given any treatment contrast 


k 
(6. 12) N=htt+het+---+lt, >&k=0 


t=] 


there will in general exist an infinite number of intra-block contrasts Y 
for which 


(6.125) E(Y) = 1 


and which thus provide an unbiased estimate of II. 

If the residuals ¢;; are homoscedastic, i.e. have a common variance 
o’, then it follows from a general theorem applicable to all incomplete 
block designs (cf. [10], [11], [12]), that the best intra-block estimate 
(i.e. the intra-block estimate with the least variance) of II, is obtained 
by substituting in II the solution of the normal equations 


(6.18) Cit + Cite + +++ + Cite = Q: (¢ = 1,2,---,») 
where 
(6.14) a r(1 - -), 
k 
(6.145) Ciu = — Neu/k tA#~u 


and Aju=A1 or Az according as the ith and uth treatments are 1st or 2nd 
associates. Q; is the adjusted treatment total for the ith treatment, 
which is defined as follows: Let 7; be the total for the 7th treatment, 
i.e. 7’; is the sum of the observations from the r experimental units to 
which the ith treatment has been applied, and B; is the total for the 
jth block, i.e. B;is the sum of the observations from the k experimental 
units in the jth block; then the adjusted treatment total Q; for the ith 
treatment is obtained by subtracting from 7';, 1/k times the sum of the 
totals of the r blocks in which the ith treatment appears. 

The solution of the normal equations (6.13) is unique up to an arbi- 
trary constant. Thus if 4, f, ---, 2 is one solution, then the general 
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solution is f+, +a, - - -, 2,4+a, where a is arbitrary. This arbitrari- 
ness however does not affect the estimation of any treatment contrast, 
since the result of substituting the general solution in (6.12) is 


(6.15) Li, + lelat+ +++ + lel, 


and is independent of a. 

It should be noted that (6.15) is an unbiased estimate of [t:+lete 
+.--+-+-+l,t, even if the assumption of the homoscedasticity of the 
residuals ¢;; is not satisfied, though in this case it will in general not be 
the intra-block estimate with the least variance. It will still be a fairly 
good estimate for all practical purposes, if the variances of ¢;;’s are 
approximately equal. The homoscedasticity of residuals is however as- 
sumed in what follows. 

To render the solution of the equations (6.13) unique it is usual to 
measure the treatment effects from their mean, which is equivalent to 
imposing the restraint 


(6.155) htbkht+---+t=0. 


The solution of the normal equations (6.13) available in current 
literature is somewhat cumbrous. But we can write the solution in the 
neat form 


(6.16) r(k — 1)é; = kQ; + oSi(Q;) + @S2(Q:), t= 1,2,---,0 


with the help of the auxiliary constants ¢:, cz, defined by (6.175) and 
(6.18), when A is given by (6.165). It is also convenient to define here 
the constant H for later use. 


(6.165) k*A = (rk —r+1)(rk — r+ ro) 

+ (dr — de) {r(& — 1)(pist — ps?) + Aepre! — pn’, 
(6.17) KPH = (2rk — 2r + dr + As) + (pis? — piz?)(Ar — Az), 
(6.175) kde, = Au(rk — r + Xo) + (Ar — As) (Aopia! — Are’), 
(6.18) kc = Ao(r& — r + Xa) + (Ar — Aa) (Acpia? — Arpn?). 


Here S,(Q;) is the sum of the adjusted treatment totals for all the Ist 
associates of the 7th treatment, and likewise S.(Q;) is the sum of the 
adjusted treatment totals for all the 2nd associates of the ith treat- 
ment.,Using the identity 


(6.185) Q: + Si(Qi) + S2(Q;:) = 0, 
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the equation (6.16) can be written in the alternative form 


k—t 
(6.19) i; = woe ey 8100. 


The variance of the intra-block estimate of the difference of two 
treatment effects is given by the formula 
20? k — Cj 


(6.195) V (t; — i.) = a 





where j= 1 or 2, according as the ith and uth treatments are Ist associ- 
ates or 2nd associates. 

Denoting by G the grand total of all the observations, the analysis 
of variance is shown in Table VI A below: 




















TABLE VI A 
INTRA-BLOCK ANALYSIS OF VARIANCE (WITH UNGROUPED BLOCKS) 
Source of variation Degrees of Sum of squares Mean square 
freedom 
J 
Treatments (adjusted) v-1 St= 2 iQ; 8:2 =S;,2/(v —1) 
i=1 
Wes Gq 
Blocks (unadjusted) b-1 St=— 2 Bi-— 
k i=! N 
Error N-—b-—v+1 S,? (by subtraction) 8,2 = S,2/(N —b —v+1) 
Gq? 
Total N-1 2 vijQ-— F =8;,2/s¢2 
oj N 














With the help of this table the F-test for significant differentiation 
among the treatments as a whole, or the t-test for the significance of the 
difference between any two particular treatments can be carried out in 
the usual manner, if the residuals e;; are normally distributed. 

2. Since the sets of treatments are assigned to the blocks at random, 
it is plausible to regard the block effect b; in (6.11) as a random variate 
with expectation zero and a variance say o;’. In this case any contrast 


b 
(6.20) Y’=a@B,+a@B,+---+a@B, > a;=0 
j=l 


between the block totals estimates a linear function of the treatment 
effects. Yates has shown in a series of papers, [13], [14], [15], that by 
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using the information contained in the contrasts between the block 
totals, we can get estimates of greater accuracy than can be got by 
using intra-block estimates only. The general formulae for the recovery 
of inter-block information were given by Nair in [16] and by Rao in [12]. 
By using the auxiliary constants ci, cz, A, and H defined by (6.165), 
(6.17), (6.175), and (6.18), we shall express the best estimates using 
both intra- and inter-block information in a neat form. 

The excess (positive or negative) of the average of the observations 
from the jth block over the general average is 


(6.205) By 00 en ares 


We define Q,’ to be the sum of the excesses E; summed up over the 
blocks in which the ith treatment occurs. Then 


rG 
6.21 /=T7T;- os as 
(6.21) Q Q W 
Let us set 
1 1 
(6.22) w=>—) wv! = ———— 
o? o? + ke;? 





(6.225) P,=wQt+w'Q/, R= re(w -< -) 





1 Niu 
(6.23) Cc’ = R.(1 ~ -). Cy,’ = — : (w — w’). 


Rao [12] has shown that the new normal equations for obtaining the 
best combined intra- and inter-block estimates are 


(6.24) Ca’ty + Cin’ te + 2 + Civ tvs = P; (t = 1, 2, Poe iv); 


so that the best estimate of the treatment contrast II defined by (6.12) 
is 


(6.25) Li,’ + lef’ +--+ + Li’, 


where i,’, io’, - - - , t,’ isa solution of (6.24). As before the solution may 
be rendered unique by imposing the restraint (6.155). 

The solution of the normal equations (6.24), using combined intra- 
and inter-block information, can be written in the compact form 


(6.26) r{w’ + w(k — 1)}i,’ = kPs + diSi(Pi) + deS2(Pi), 
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where 

c;A(w — w’)? + rdjw’'(w — w’) 
A(w — w’)? + rHw'(w — w’) + rw” 
and S,(P;) is the sum of the P’s for all the Ist associates of the ith 


treatment and S2(P;) is the sum of the P’s for all the 2nd associates of 
the ith treatment. On using the identity 





(6.265) — j=1,2 


(6.27) P; + Si(Pi) + 82(P:) = 0, 
the equation (6.26) can be written in the alternative form 
k— d, — d 
(6.28) 7,’ : mae Si(P)). 








- r{w’ + w(k — 1)} ii r{w’ + w(k — 1)} 


The variance of the estimate of the difference of two treatment ef- 
fects, using both intra-block and inter-block information is given by 


2(k — dj) 
r{w’ + w(k — 1)} 
where j= 1 or 2, according as the 7th and uth treatments are Ist associ- 
ates or 2nd associates. 


The quantities w and w’ which enter into the coefficients of the equa- 
tions (6.26) are not known and have to be estimated from the observa- 





(6.29) Vii,’ — i’) = 


TABLE VI B 


AUXILIARY TABLE FOR INTER-BLOCK ANALYSIS OF VARIANCE 
(WITH UNGROUPED BLOCKS) 




















Source of variation Degrees of Sum of squares Expectation of the sum 
freedom - of squares 
° oT G? 
Treatments (unadjusted) v-1 8/2= > —-— 
im1 Ti N 
Blocks (adjusted) b-1 Sp? (by subtraction) (b—1)02+(N —v)o:2 
Error N-—b-—v+1 S,? (by transference) (N —b —v+1)o? 
Gg? 
Total N-1 z yijt— = 














tions themselves. This can be done by forming the auxiliary Table 
VI B. Certain items have to be transferred from the main Table VI A. 
o and o;” are then estimated by equating S,? and S,? to their ex- 
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pectations, and w and w’ are derived by substituting these estimates in 
(6.22). 

3. It may be possible in certain cases to group the sets of treatments 
to be applied to the blocks in such a way, that within each group every 
treatment is replicated n times. These groups may be called replication 
groups. Thus if there are r’ groups, then r=nr’. For example in the 
design no. 1. of Table I A and B, r=4, n=2, r’=2, and in the design 
no. 3 of the same tables, r=4, n=1, r’=4. The experimental material 
in this case should be so subdivided that the blocks to which the treat- 
ment sets of the same replication group are assigned, are relatively 
more alike. Thus in agronomic experiments, the land may be divided 
into r’ large pieces corresponding to the replication groups, and then 
each piece subdivided into b’=b/r’ blocks. In actually laying down the 
experiment one has now to make a threefold randomization in the as- 
signment of (i) the replication groups to groups of blocks, (ii) the sets 
of treatments in a replication group to the individual blocks in the 
block groups, (iii) the treatments of a set to the experimental units in 
the block. Suppose that the jth block now occurs in the cth group, and 
yi; is the observed result of applying the ith treatments to a unit in 
the jth block (provided that the 7th treatment occurs in the set of 
treatments which has been assigned to the jth block). The appropriate 
model in this case (in the absence of interaction between blocks and 
treatments) is 


(6.30) Yg = Qtt t+ pe tryst ey, 


so that the observation is considered to be the sum of (i) a general com- 
ponent g, (ii) a component ¢;, the effect of the ith treatment, (iii) a 
component p,, the effect of the cth replication group, (iv) a component 
x;, the differential effect of the jth block, within its replication group 
(v) a random component e;; with expectation zero, the €;;’s being inde- 
pendently distributed. 

So far as intra-block estimates are concerned there is no change from 
the case of ungrouped blocks. If the residuals ¢€;; are homoscedastic, 
then subject to the constraint (6.155), the best intra-block estimates of 
the treatment effects are given by (6.16) or (6.19) and the formula 
(6.195) remains valid. In the intra-block analysis of variance, the block 
sum of squares can be split up into two components, and the analysis 
can be exhibited in the form given in Table VII A. There is however no 
change in the F-test. R, stands for the sum of the block totals in the 
cth replication group. 

The best estimate of the treatment effects, combining both intra- and 
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inter-block information is given by (6.26) or (6.27), and the formula 
(6.29) remains valid, if o? and o;? are now estimated by equating S,? 
and S}..? to their expectations in the auxiliary Table VII B. As before 
items have to be transferred from the main table VII A. 


TABLE VII A 


INTRA-BLOCK ANALYSIS OF VARIANCE (WITH GROUPED BLOCKS) 



































ra Degrees of 
Source of variation Fee ee Sum of squares Mean square 
v 
Treatments (adjusted) v—-l Sii= Zz t:Q; 8,2 = S;2/(v—1) 
i=1 
vv 
= R22 @? 
Replication groups r’—1 S;?= z—-— 
cl nv N 
Blocks within replication b By ” Re 
groups (unadjusted) b—r’ Sow’? = 2—-2s— 
jimi & cm nv 
Error N—b-—v+1 S,? (by subtraction) 8? = S,2/(N —b —v +1) 
Gg? 
Total N-1 2z wij? -— — PF =8;2/s¢2 
i,j N 
TABLE VII B 


AUXILIARY TABLE FOR INTER-BLOCK ANALYSIS OF VARIANCE 
(WITH GROUPED BLOCKS) 























— Degrees of Expectation of the sum 
Source of variation nett Sum of squares of emnene 
° —2 @ 
Treatments (unadjusted) v-—l = —-— 
i-1 7 N 
Replication groups r’—l S,? (by transference) 
Blocks within replication 
groups (adjusted) b—r’ Siw? (by subtraction) | (b—r’)o2?+(N —v—kr’ +k)os? 
Error N-b-v+1 S,? (by transference) (N —b —v+1)a? 
G2 
Total N-1 2 wij? - — 
ii N 














4. Finally we shall consider the case when we have replication groups 


orthogonal to the blocks. Thus the first a positions within the block 
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give a group with n replications, the next a positions do the same an 
so on. There are r’ replication groups where k=ar’, r=nr’. For example 
in the design no. 9 of Table I A and B, r=5, n=1, a=1, and in the de- 
sign no. 1 of Table III A and B, r=3, n=1, a=2. Such designs may be 
used for elimination of heterogeneity in two ways (i.e., between blocks, 
and between replication groups orthogonal to the blocks). Classic ex- 
amples are the Latin square [17], and Youden square designs [18]. 
Another example will be discussed in section VII. In randomizing 
within blocks, replication groups should be left undisturbed. Suppose 


TABLE VIII A 


INTRA-BLOCK ANALYSIS OF VARIANCE (WITH REPLICATION GROUPS 
ORTHOGONAL TO BLOCKS) 























7 Degrees of 
Source of variation Prana Sum of squares Mean square 
v 
Treatments (adjusted) v-—1 Se= 2 4:Q; 8:2 =S;,2/(v —1) 
i=1 
: B;? q 
Blocks (unadjusted) b-1 St= > —- -— 
jo ik N 
R22 @G@ 
Replication groups r’—1 S;? =) — -— 
nv N 
Error N—b—v—r’'+2)| 8,2 (by subtraction) 8-2 = S,2/(N —b —v —1’ +2) 
G? 
Total N-1 Zz wij? - — F =8;2/a¢? 
ii N 














the ith treatment occurs in the set which has been assigned to the jth 
block, and within this block has been applied to an experimental unit 
in the cth replication group, giving the observation y;;. The appropriate 
model in this case is (assuming the absence of interaction between 
blocks and treatments), 


(6.40) Ys =Qttt pe toyt a; 


so that the observation is considered to be the sum of (i) a general com- 
ponent g, (ii) a component ¢;, the effect of the ith treatment, (ili) a 
component p,, the effect of the cth replication group, (iv) a component 
b;, the effect of the jth block, (v) a random component ¢;; with expecta- 
tion zero, the ¢;;’s being independently distributed. 
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If the residuals e;; are homoscedastic, the best intra-block estimates 
of the treatment effects are given by (6.16) or (6.19), and the variance 
of the estimated difference of two treatments is given by (6.195). The 
analysis of variance is now given by Table VIII A, where R, denotes 
the sum of the observations for the cth replication group. 

The formulae (6.26), (6.28), and (6.29) for best estimates combining 
intra- and inter-block information remain valid, provided that o? and 
o;? are estimated by equating S,? and S,? to their expectations in the 
auxiliary Table VIII B. 

TABLE VIII B 


AUXILIARY TABLE FOR INTER-BLOCK ANALYSIS OF VARIANCE 
(WITH REPLICATION GROUPS ORTHOGONAL TO BLOCKS) 























Source of variation Degrees of Sum of squares Expectation of the sum 
freedom of squares 
T? @ 
Treatments (unadjusted) v-1 S/2=2 —— N 
rT 
Blocks (adjusted) b-1 Sp? (by subtraction) (b—1)o2+(N —v)o:? 
Replication groups r’—1 S,? (by transference) 
Error N—b—v-—r’'+2)| S,? (by transference) (N —b —v —r’ +2)o? 
G? 
Total N-1 Zz vii? - — 
i,j N 














VII. COMPUTATIONAL PROCEDURE FOR ANALYZING PARTIALLY 
BALANCED DESIGNS WITH TWO ASSOCIATE CLASSES 


1. The values of the constants c;, cz, H, and A have been provided for 
all designs whose parameters have been given in Tables I-V. Thus the 
solutions of normal equations for both the intra-block and the com- 
bined intra- and the inter-block analysis can be written down directly. 
The main labor of solving these equations lies in the evaluation of the 
quantities S,(Q;), Si(P;) from the actual observations. This calculation 
can be systematized by using the association schemes already dis- 
cussed. We shall in the next paragraph illustrate in detail the computa- 
tional procedure with reference to a design of the triangular type. The 
necessary modifications for other types are indicated later. 

2. The observations have been obtained by superposing the design 
on a part of a blank experiment planted uniformly to Mexican 87 
variety of cotton, located on the Upper Coastal Plain Branch Experi- 
ment Station, Rocky Mount, North Carolina. This cotton was grown 
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during 1941. The stand was fairly uniform and all rows received the 
same fertilizer and cultural treatments. Thirty-six rows, 300 feet long 
were harvested in sections 25 feet in length. A cord was stretched across 
the field perpendicular to the rows at intervals of 25 feet. The cotton 
from each 25 foot plot was harvested, weighed, and the weight recorded 
separately. These weights varied from 0.93 to 3.25 pounds per plot, 
with a mean of 2.21+.20 pounds. A total of 432 plots were harvested 
in this manner. 

The particular design selected for the purpose of illustration is the 
design no. 4 of Tables II A and II B used as a two-way design. The 
columns of the plan in Table II B (representing the blocks) were 
randomized, and then the rows (representing the replications ortho- 
gonal to the blocks) were randomized. The plan as it emerged after 
randomization was superposed on a randomly selected part of the 
blank experiment described above, the blocks being four consecutive 
plots of the same row, and the replications running perpendicular to 
the rows over ten rows. The resulting field plan and the yields are given 
in Table IX A together with the block and replication totals and 
means. 


TABLE IX A 
FIELD PLAN, YIELDS, BLOCK AND REPLICATION TOTALS, AND MEANS 








Block Treatment and yield Block total Block mean 








1 t 2.54 c 2.89 g 2.72 e 2.28 10.43 2.6075 
2 g 2.29 b 2.51 Sf i. j 1.54 8.11 2.0275 
3 ce 2.29 i 2.44 a 2.31 f 2.49 9.53 2.3385 
4 f 2.31 d 2.54 e 2.81 h 2.81 10.47 2.6175 
5 h 2.99 a 2.86 j 2.87 t 2.23 10.95 2.7375 
6 a 1.65 h 2.28 d 2.09 g 1.57 7.59 1.8975 
7 d 2.36 f 3.02 t 2.12 b 1.41 8.91 2.2275 
8 b 1.90 g 2.60 h 2.44 ec 1.95 8.89 2.2225 
9 e 2.20 j 2.77 b 3.06 a 2.58 10.61 2.6525 
10 j 2.09 e 2.07 c 2.03 d 2.03 8.22 2.0550 
Replication 
total 22.62 25.98 24.22 20.89 G=93.71 
Replication 
mean 2.262 2.598 2.422 2.089 G/N =2.34275 











The adjusted treatment totals Q, and the quantities Q’ (for inter- 
block analysis) for each treatment can be calculated by placing the 
yields corresponding to any treatment in a row, and the corresponding 
block averages in a second row. Then the total of the first row gives 7’, 
the total of the second row gives 7’—Q, and the difference of these 
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1.57 


2.60 


9.18 =T,, 
8.755=T,—Q,, Q,’= —0.616 


Q 


totals gives Q. Also (T—Q)—rG/N gives Q’. The calculation for the 
treatment g is given below for illustration. 


2.29 
2.6075 2.0275 1.8975 2.2225 


= 0.425 











The quantities 7, Q are entered for each treatment in the Table 
IX B and the quantities Q’ in the Table IX G. A check is afforded by 
remembering that 

































(7.20) ~—T=G4, DYeQ=90, DQ=0. 
TABLE IX B 
- Adjusted 
Treatment yg Q S:(Q) t treatment 
mean 
a 9.40 —0.2700 0.0525 —0.07917 2.26 
b 8.88 —0.2500 —0.3075 —0.06539 2.28 
c 9.16 —0.1075 0.8800 —0.05061 2.29 
d 9.02 0.2225 —1.0300 0.08717 2.43 
e 9.36 —0.5725 0.6600 -—0.18006 2.16 
f 9.59 0.3350 0.3175 0.08972 2.43 
g 9.18 0.4250 —1.1125 0.14750 2.49 
h 10.52 1.0450 —1.4225 0.33350 2.68 
t 9.33 —0.6250 0.6675 —0.19539 2.15 
I 9.27 —0.2025 1.2950 —0.08728 2.26 








We can now calculate the quantities S,(Q) occurring in the equations 





(6.19) for intra-block analysis by using the association scheme. The 
adjusted treatment total Q for any treatment is substituted in the as- 
sociation scheme, and the row and column totals are calculated. We 
thus have 














TABLE IX C 

* a b c d Row total 
—0.2700 —0.2500 —0.1075 0.2225 —0.4050 

a . e f 9 
—0.2700 —0.572h 0.3350 0.4250 —0.0825 

b e * h i 
—0.2500 —0.5725 1.0450 —0.6250 —0.4025 

c f h ¥ j 
—0.1075 0.3350 1.0450 —Q.2025 1.0700 

d g i j ’ 
0.2225 0.4250 —0.6250 —0.2025 * —0.1800 








Column total —0.0825 
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A check on calculations is afforded by remembering that the row 
totals or column totals should add up to zero. Taking any treatment 
we may denote by S’(Q) the sum of the row totals for the two rows in 
which the treatment appears. From symmetry, however, we can pick 
up the treatment in any one position and find the sum of the corre- 
sponding row and column total to get S’(Q). Then 


(7.21) 8(Q) = 8’(Q) — 2Q. 


For example, 
S:(Q,) = — 0.0825 — 0.1800 — 2(0.4250) 
= — 1.1125. 


The quantities S:(Q) calculated from (7.21) are entered in the Table 
IX B. The intra-block estimates / are then calculated by using (6.19). 
Since for the design under consideration c; =4/15, co=8/15, r=4, k=4, 
we have 


er 
~ 45 — 


For example Q, = 0.4250, S:(Q,) = — 1.1125, 4, =0.14750, and the cor- 
responding adjusted treatment mean (obtained by adding the grand 
mean G/N to it) is 2.49. 

As a check it should be remembered that 


(7.22) > S(Q)=0, DLi=0. 


Also r times the sum of the adjusted treatment means should be G. 
Thus the total of every column in Table IX B is predetermined. 

The intra-block analysis of variance follows Table VIII A. All the 
quantities necessary for forming this are available in Tables IX A and 
IX B. Hence the intra-block analysis is as shown below. 


TABLE IX D 
INTRA-BLOCK ANALYSIS OF VARIANCE 











Source of variation Degrees of freedom Sum of squares Mean square 
Treatments (adjusted) 9 0.7467 0.0830 
Blocks (unadjusted) 9 3.2284 
Replications (unadjusted) 3 1.4234 

Error 18 1.6351 0.09084 





Total 39 7.0336 
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The ratio of the treatment mean square to the error mean square 
should obey the F-distribution with df. 9, 18. Here F <1, and is insig- 
nificant as we should expect since the data have been taken from a uni- 
formity trial, and the two mean squares estimate the same variance. 

The estimate of the variance of the intra-block estimate of the differ- 
ence between two treatments is given by 


2827 k-— Gy 28,7 k — Ce 
. or . 
r k-1 r k-1 


(7.23) 


, 


according as the treatments are first associates or second associates. 
Here c,=4/15, co=8/15. Hence this estimate is .0565 or .0525 and its 
square root .238 or .229 is the quantity appearing in the denominator 
of the t-statistic (18 df.) to test for the significance of the difference 
between two treatments. 

To make the inter-block analysis, we need the estimates of the 
weights w and w’. This is done by setting up the auxiliary table corre- 
sponding to VIII B. All the quantities for making up this table are 
already available. 


TABLE IX E 
AUXILIARY TABLE FOR INTER-BLOCK ANALYSIS OF VARIANCE 








Expectation of the 


Source of variation Degrees of freedom| Sum of squares sum of squares 





Treatments (unadjusted) 0.4566 
Blocks (adjusted) 3.5185 90? +300;? 
Replications 1.4234 

Error 1.6351 18¢:2 





Total 7.0336 














The estimate of o? is s,2=.09084 (the same as the mean square for 
error in Table IX D), and the estimate of o;? is s;2=.09003 obtained 
from 


9s,? + 30s)? = 3.5185. 


The estimates of w and w’ are given by 1/s,” and 1/(s,2+-4s,”) since 
k=4, and are therefore 


w = 11.0084, w’ = 2.2175. 
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We can now compute the quantities P for each treatment from the 
relation 


P = 11.0084Q + 2.2175Q’, 


and enter these in the Table IX F (the values of Q being taken from 
IX B). The quantities S,(P) now may be computed in the same manner 
as S,(Q) by entering the values of P in the association scheme and find- 
ing the row and column totals. This is done in Table IX F. 


TABLE IX F 








a b c Row total 
—2.3093 —3.2865 —1.4129 —5.8310 
* e f 
—5.0572 3.4306 —0.6233 


e * h 
—5.0572 11.7343 


° h . j 
3.4306 11.7343 11.7479 


g i i 
3.3126 —5.5852 —2.0041 —3.0990 




















Column total —0.6233 —2.1946 11.7479 





The row totals and the column totals should add up to zero. Taking 
any treatment let S’(P) be the sum of the totals for the row and column 
in which it appears. Then 


(7.24) Si(P) = S'(P) — 2P. 


The quantities S,(P) are entered in the Table IX F. The total of the 
column giving S,(P) should vanish. 

Now d, and d: are calculated by using (6.265), remembering that for 
this design c,=4/15, c.=8/15, A=45/4, and H =27/4 (cf. Table IT A). 
Hence we get 

d, = .2101, dz = .4202. 


Using the equation (6.28) we see that the combined intra- and inter- 
block estimates {’ can be computed from 


i’ = .02539P — .00149S,(P). 


These values are entered in Table IX G for each treatment, and finally 
the adjusted means, are obtained by adding to ?’ the grand mean G/N. 
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TABLE IX G 

Adjusted 

Treatment Y P S.(P) ¢ treatment 
mean 
a 0.2990 —2.3093 —1.8357 —0.05589 2.29 
b —0.2410 —3.2865 —1.4526 —0.08128 2.26 
c —0.1035 —1.4129 8.7427 —0.04890 2.29 
d —0.5735 1.1777 —11.2854 0.04672 2.39 
e 0.5615 —5.0572 7.2965 —0.13927 2.20 
tf —0.1160 3.4306 4.2634 0.08075 2.42 
g —0.6160 3.3126 —10.3475 0.09953 2.44 
h 0.1040 11.7343 —13.9153 0.31866 2.66 
t 0.5840 —5.5852 5.8768 —0.15057 2.19 
j 0.1015 —2.0041 12.6571 —0.06974 2.27 





Using (6.29) we find that the variance of the estimated difference 
between two treatments (using both intra- and inter-block information) 
is .0538 or .0507 according as the treatments are Ist associates or 2nd 
associates. 

There is no exact test of significance but an approximate test is ob- 
tained by testing > i’P as a x? with v—1 degrees of freedom. Here 


> ’P = 6.551 


so that with 9 df., we have .50< p(x?) <.70. x? is obviously insignifi- 
cant, as it should be. 

3. The only modification necessary for other types of partially bal- 
anced designs with two associate classes, is in the calculation of the 
quantities S,(Q) and S;(P). We shall discuss this for each type sepa- 
rately. 

(a) For the group divisible type any two treatments occurring in the 
same column of the association scheme are first associates. The Table 
IX C is now formed by substituting the adjusted treatment total Q for 
any treatment. in the association scheme, and calculating the column 
totals. Taking any treatment we may denote by G(Q) the total of the 
column in which the treatment appears. Then 


(7.30) S:(Q) = GQ) - Q. 


Analogously the Table IX F is formed by substituting the quantities 
P in the association scheme. If G(P) is the total of the column.in which 
a particular treatment appears, then 


(7.31) Si(P) = G(P) — P. 


(b) For the Latin square type we form the Table [X C by substitut- 
ing the adjusted treatment totals Q in the association scheme. When 
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i=2, ie. there are two constraints, let R(Q) and C(Q) denote the row 
total and column total for any particular treatment. When 7=3, let 
L(Q) further denote the sum of the Q’s for all treatments corresponding 
to the same letter as the given treatment. Then 


(7.32) Si(Q) = R(Q) + C(Q) — 2Q when i = 2, 
(7.33) Si(Q) = R(Q) + C(Q) + L(Q) —3Q.~—s when = 3. 


S,(P) may be calculated in a similar manner. 

(c) For the singly linked block type, the association scheme is the 
same as the plan. Table IX C is formed by substituting the adjusted 
treatment totals Q in the association scheme, and obtaining the column 
totals. Let B(Q) be the sum of totals for the r columns in which a treat- 
ment appears. Then 


(7.34) Si(Q) = BQ) — rQ. 


S:(P) may be calculated in a similar manner. 

(d) The calculation of S,(Q) and S,(P) for the cyclic type will be 
illustrated by taking an example. Consider the design no. 1 of Tables 
V A and B. The complete association scheme is obtained by setting out 
the d’s in the first column, and getting the other columns by adding the 
numbers 1, 2, - - - , 12 and reducing mod 13 since v= 13. The complete 
association scheme is thus: 














Column No. 

(1) | (2) (3) | (4) (5) | (6) (7) (8) (9) (10) | (11) | (12) | (13) 
2 3 4 | 5 6 7 8 9 19 ll 12 13 1 
5 6 7 8 9 10 ll 12 13 1 2 3 4 
6 7 8 9 10 ll 12 13 1 2 3 4 5 
7 8 9 10 11 12 13 1 2 3 4 5 6 
8 9 10 ll 12 13 1 2 3 4 5 6 7 
11 12 13 1 2 3 4 5 6 7 8 9 10 


























The first associates of the treatment 7 now occur in the corresponding 
column (i) of the association scheme. The Table IX C is now formed 
by substituting the adjusted treatment totals Q in the association 
scheme. S,(Q) for any particular treatment is obtained by taking the 
total of the corresponding column. S,(P) may be calculated in a similar 
manner. 
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REPLACING VARIABLES IN CORRELATION PROBLEMS 


Vincent I, West 
University of Illinois 


HE importance of using matrix-inverting methods in solving normal 
eat has often been emphasized. Computation of the matrix 
inverse to the complete correlation matrix or to the matrix of correla- 
tions among the independent variables is usually the easiest way to 
evaluate the standard errors of the regression constants. For special 
problems the complete inverse may not be needed [10, page 456]. Be- 
cause of the need to compute the standard errors, and the frequent 
desirability of working later with slightly altered sets of variables, how- 
ever, it would seem that the inversion of the matrix of correlations (or 
covariances) should be the initial step in solving most problems involv- 
ing normal equations [5]. 

After solving the normal equations of correlation problems one may 
wish to delete a variable, to replace a variable, or to add another 
variable. In some situations these operations may be performed most 
efficiently by making the desired deletions, substitutions, or additions 
and then re-solving the normal equations. In other situations the de- 
sired changes may be effected more efficiently by operations on the in- 
verse matrix already computed, without directly re-solving the normal 
equations. One of the advantages of using matrix-inverting methods is 
the possibility of utilizing the inverse in deriving solutions for altered 
sets of variables. 

The purpose of this paper is to describe one method which may be 
useful in situations where the direct re-solving of the normal equations 
would be laborious. The solution will first be indicated by the problem 
of replacing the fourth variable in a five-variable problem. The problem 
will then be generalized to the replacing of the kth variable in an 
n-variable problem. In discussing the five-variable problem the matrix 
inverse to the correlation matrix will be called the “P table” so that this 
discussion may be more easily understood by computers who are ac- 
customed to the “Waugh Method” [1]. 


COMPUTATIONS TO REPLACE THE FOURTH VARIABLE 
IN A FIVE-VARIABLE PROBLEM 


If a decision to replace the fourth variable with another is made after 
solving a five-variable problem, the P table already calculated may be 
used to get the P table in the five-variable set excluding X, but includ- 
ing the new variable (say X¢). 


185 
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Step 1. Delete the fourth variable from the P table using either of these 
equivalent formulas.' 


PisPu — PaPsj 
Pi jy, = P ) or 
44 


Pis 
Pina = Pig — Pu(=*) , 
44 


Step 2. Compute the following constants which are to be used in step 
3 to get the elements of the desired P table. 


Kye = Puyadris + Pizyacres + Pisyactss + Pisyait'ss 
Ko = P24 (716 + Po2)4 (726 + Posts + P54 ('s6 
Kye = Pisyactie + Pesyacree + Pesyacrss + P54 c'se 
Kse = Pyare + P54 (728 + P35)4 (36 + Pesjacse 


Step 3. Compute the new P table. The elements of this table will be 
indicated by P;; just as they were in the old table. In computing 
work this will cause no confusion since the stub and caption of the 
P table will indicate the variables involved. 





1 P33= P33)4,— PseK 36 
Pe 





"1 Ku Keta~Keta—Ketu Po3= Po3)4,— PooK 3 
P= — PeKse P13= P43)4,— PisK 36 
P= — PeeKse 
Pog = — PesKee 
Pys= — PoKic 


P22 = P224(— PooK2e 
P12 = P12)4.— PisKeos 
Fic Pear ~Pkes Py = Pry — PKs 
P35= P35)4,— PoeKs6 
Pos = P25)4,— PoeKse 
Py5= P54, — PisKs6 
The table presents a numerical example. Lines 1 to 5 give the corre- 


lation coefficients among the variables. In the present application it is 
assumed that the matrix inverse to the matrix of the first five columns 





1 This formula is implicitly in [1] and is derived explicitly in [2] except that Waugh used P,j)x/ 
=R;; ./R rather than P;;)k( =Rij)kc/Rkk as above. The latter seems preferable since they are the same 
function of the reduced set as the P;; are of the original set. 
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had already been computed. This is presented on lines 6-10 of the 
table. This matrix is presented in its entirety, even though symmetric, 
for convenience in deleting the X, column of the inverse (step 1). 


CompvuTaTions TO RepLace X. BY X¢ IN THE INVERSE OF A FIVE-VARIABLE 
CoRRELATION MATRIX 








Xs 





-03 
-16 
— .33 
1.00 





-09901 
— .02756 
-54300 


Kis 





-51765 . -04131 
- 90992 . -49542 
.32074 . - 15639 
-67830 . — .69961 





-69961 (.65782) 





.56159 —.06280 
. 143681 —.75312 
-48707 - 23774 


.42236 1.06353 
1.52017 





The inverse of the matrix of correlation coefficients in the four- 
variable set which excludes X, is entered on lines 11-14. To the left 
of this inverse are three columns presenting rie, Kis, and Ps. The latter 
are elements of the desired inverse but are reproduced here for con- 
venience in computing. 

The Kj are computed according to step 2. Multiplying the elements 
of the first column by the corresponding, elements of the rs column 
and summing gives Ky. in one machine operation. 


Ky = 1.17738(— .19) + .39546(.14) + .15524(.05) 
— .51765(— .39) = .04131. 


Kx, Kye, and Kss are computed by similar multiplication in the sec- 
ond, third, and fifth columns, respectively. Subtracting the sum of the 
products riK :¢ from unity gives the number in parentheses at the bot- 
tom of the Kj. column. The reciprocal of this number is Pes recorded 
just to the right in the Ps column. 
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The remaining P;.are computed by multiplying Pes by — Kis. Each 
. remaining element of the desired inverse is the corresponding entry on 
lines 11-14 reduced by the product of the Ke; (on line 15 below that 
entry) and P (on the extreme right). 

For example: 


P3 = — .32074 — (— .69961)(— .23774) = — .48707. 


It should, perhaps, be pointed out that the use of the inverse of a 
submatrix is not a new idea. Hotelling [4] presented a method for in- 
verting partitioned matrices in 1943. Waugh [3] improved this method 
in 1945. The improved method provides basis for the technique de- 
scribed in this paper. In 1946 a paper by Louis Guttman [8] used this 
method for inverting matrices by the inversion of submatrices of suc- 
cessively higher order. Guttman attributes the technique to Fraser, 
Duncan, and Collar [9]. 

The present paper is intended primarily to present a computing 
technique, although it may also give wider currency to the knowledge 
that an inverse can be altered in this way.’ 

Waugh [3] calculated the inverse of a partitioned matrix as 


ab\"' /A 
(’ 2 " * o 
D = (d — ca™b)“ B= — Dea" 
C= a "dD A=a'!—a "0B 
In the problem of replacing the kth variable by the (n+1)st we may 
first delete the kth variable by the formula 
Pi;Pix — PaPr; 
Pr 





Pinn = 


Ti2 es Oe 


T21 T22 “**3s-a Paees 


[Pied] = To—1,1 Ta-19 °° ° Ta-1,d—1 Th-2,b41 °° * Ta—1.8 


Teil 7k41,2 °° * Thei,k—1 7k41,k41 °° * Tkti,n 








LL. Tn Tn2 “**9epeg Tht °° * Tae ad 





2 The equations for these operations are given in matrix form in [11] Chapter 16 and 17. 
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Adding the (n+1)st variable (to replace the kth) gives the following 
matrix to be inverted 


Tu ri -**Me28 Vise “** a *  Tijn+1 7 


T21 T22 "°° seg That °** hw *  T2,n+1 


Temi. Th-1,2 °° ° Th—1,k—-1 Te-1,k41 °° ° Tk-tn °° Tk—1,n+1 


Te-1,1 Th41,2 °° ° Th41,k—-1 The1k4 °° * Pepin * | Tk4+1,n41 


Tr Tn2 © * Te bad Tn k+1 o* * Tie ° Tra,nt1 








LL nt Pngt2 °° * Tn¢tk—-1 Tnt¢tkt¢i ’ °° Tntisn * Tatintl 4 
where P is of the order n but contains the (n+1)st variable. The 
method of partitioning makes possible the following substitutions in 
Waugh’s formulas. 

1. d is the scalar rasa, ngi=l 

2. b is the column vector {rj, ny1}(i=1,2---k—-1,k+1,---,n),¢€ 
is its transpose, and 

3. {a-'b} is the transpose of [ca-"] 

4. a'=[P3i;,ed 

COMPUTATIONS 

First compute {a-'b} =[ca-"]’ as an aid in giving computing instruc- 

tions this vector may be called { Kj, n4:} 


Ki n4t = px Payer nti 
h 
The elements of the desired inverse may then be calculated 


D = Pasi.ngi = 1/(1 — Zz rastaK rast) 
h 


C= { ~~ PassngtKintt} 
B= C’ 
[Pisvec oe Kin4iPas1,5| 14,j=1, 2°", k-1, k+1,---,”. 
The formulas which give the inverse of a partitioned matrix are much 
more general than this application. They are applicable to other sym- 


metric matrices, such as the covariance or product—sum matrices 
sometimes used in solving normal equations. Extension to nonsym- 
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metric matrices, and to matrices which differ in more than one row and 
column are effected respectively by introducing separate vectors c and 
b into the problem and by interpreting each of these as a matrix rather 
than a vector. In the latter case the new square matrix would no longer 
be scalar. The method of Woodbury [7] may have particular advantages 
for nonsymmetric cases. 
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OPERATING CHARACTERISTICS FOR TESTS OF THE 
STABILITY OF A NORMAL POPULATION 


Joun E. WatsH 
Naval Ordnance Test Station* 


This paper presents values of the operating characteristic 
(OC) function for a common type of stability test and for two 
possible substitutes for this test. The situation considered is 
that of smal] samples from normal populations. The purpose 
of a test is for deciding whether a sample is from a specified 
population. 


HE common type of test investigated is based on limits for the 
yp we mean and standard deviation (using n—1). One test is 
based on limits for the t-statistic and the sample standard deviation 
and another on limits for the sample mean and the estimate of the 
population standard deviation obtained by using the mean of the speci- 
fied population (which determined the limits). Each of the three tests is 
found to have regions where its operating characteristics are poor. No 
one test has uniformly better operating characteristics than the others. 
On the whole, the tests based on the ¢-statistic and the sample standard 
deviation appear to be inferior to the other two, which are roughly 
equivalent. An extensive OC function analysis is presented for the com- 


mon type of test. One possible application of the results of this paper 
is in quality control work. 


INTRODUCTION 


In this paper, all statistical populations involved are assumed to be 
normal. The problem is to test whether a small sample came from a 
specified population. This is done by computing the values of two sta- 
tistics and observing whether both fall within limits determined by the 
specified population. The normal population used to determine the 
limits has known mean m and known standard deviation ¢. The normal 
population from which the sample was drawn has unknown mean m’ 
and unknown standard deviation o’. The nuli hypothesis tested is that 
both m’ =m and o’ =c. Repeated use of this test procedure furnishes a 
method of investigating the stability of a normal population. 

The results of this paper are directly applicable to quality control 
situations where the assumptions underlying the tests can be con- 





* The tables contained in this paper were computed while the author was with The Rand Corpora- 
tion. 
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sidered to hold. When a new sample is obtained, the values of the two 
statistics can be plotted on control charts. This procedure furnishes a 
significance test for the population from which the new sample was 
drawn. If one or both of the plotted statistics falls outside its control 
limits, the population for the new sample is “significantly different” 
from the population determining the control limits on the charts. 

Let the n values of the sample be denoted by 2, - : - , tn. The com- 
mon type of test is based on limits for each of the statistics! 





== > x;/n, 8 =4/ . (x; — £)?/(n — 1). 


In this investigation, the limits chosen for % and s will not be the “3- 
sigma” ones. Instead, these limits will be selected so as to obtain certain 
specified significance levels and at the same time simplify computations 
and comparisons. The procedure followed is to choose the limits so 
that the probability of exceeding the upper limit equals the probability 
of being less than the lower limit equals a common value £ for each of 
€ and s when the null hypothesis holds. The limits are found by deter- 
mining the value of 6 which yields a specified significance level a for the 
test. The significance levels considered for this common type of test 
are a=.001, .005, .01, .02, .05 while the sample sizes are n=3, 5, 7. 

Although Z is an estimate of m’, the probability of Z lying outside its 
limits depends not only on how m’ compares with m but also on the 
values of o’ and o. Thus an unusual value of # cannot necessarily be 
attributed to a difference between m’ and m. Consequently, the < sta- 
tistic does not furnish a strict measure of how m’ compares with m. 
However, let Z be replaced by the ¢-statistic 


t = ./n (z — m)/s, 


so that limits for ¢ and s are used. The ¢-statistic furnishes a strict 
measure of how m’ compares with m (s furnishes a strict measure of 
how o’ compares with c). This is the motivation for considering tests 
based on ¢ and s. The limits are chosen so that separately, for each of 
t and s, the probability of exceeding the upper limit equals the proba- 
bility of being less than the lower limit equals a common value y when 
the null hypothesis is satisfied. The limits are determined by finding the 
value of y which yields a specified significance level a. Table 1 contains 





1 Exactly the same operating characteristics would be obtained if n were used in place of n—1 in 
the definition of s. However, the values of the limits would be changed. 
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values of the limits for a=.005, .01, .02, .03, .04, .05 and n=3, 5. For 
given a and n, this test is seen to be directly comparable to the corre- 
sponding test based on & and s. Due to the computational difficulties 
involved, the only cases where the tests will be compared are for a=.01 
and n=8, 5. 

When the null hypothesis is satisfied, m’ =m and the statistic 





s(m) = / x in aie 


is an estimate of o’ which is more efficient than s. Hence, it appears that 


TABLE 1 
LIMITS FOR t AND s AT SEVERAL VALUES OF a 








n=3 





Limits for s 
Limits for t Limits for t 
Upper Lower 











+ 18.28 2.5550 -0387¢ +6.571 
+12.90 2.4160 -0547¢ +5.435 
+ 9.077 2.2670 -0775¢ +4.461 
+ 7.376 2.175¢ -09500 +3.954 
+ 6.357 2.1080 -1098¢ +3.619 
+ 5.659 2.0540 - 12280 +3.371 























tests using Z and s(m) might be preferable to those based on # and s or 
on t and s. This is the motivation for investigating tests based on # and 
s(m). The limits are selected so that separately, for each of Z and s(m), 
the probability of exceeding the upper limit equals the probability of 
being less than the lower limit equals a common value 6 when the null 
hypothesis holds. The limits are determined by finding the value of 6 
which yields a specified significance level a for the test. Table 2 con- 
tains values of the limits for a=.005, .01, .02, .03, .04, .05 and n=3, 5. 
For specified a and n, this test is directly comparable to the correspond- 
ing test based on Z and s, also the corresponding test based on ¢ and s. 
The only cases where the three types of tests will be compared are 
a=.01 and n=3, 5. 

It is well known that the t-test for the population mean is less efficient 
than the test based on # and known variance. Since ¢ is not statistically 
independent of s, however, this efficiency relation does not necessarily 
imply that, for fixed significance level, the tests based on # and s have 
better operating characteristics than those based on ¢ and s. Likewise, 
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TABLE 2 
LIMITS FOR # AND sa(m) AT SEVERAL VALUES OF a 




















n=3 n=5 

a Limits for s(m) Limits for s(m) 
Limits for 2 Limits for 2 

Upper Lower Upper Lower 
.005 m+1.733¢ 2.283¢ .0994¢ m+1.346¢ 1.995¢ -2166¢ 
-01 m+1.606¢ 2.1720 ~=S .1258¢ m+1.248¢ 1.9lle - 25060 
.02 m+1.470¢ 2.054¢ | .1593¢ m+1.143¢ 1.823¢ -2904¢ 
.03 m+1.384¢ 1.983¢ .1831¢ m+1.078¢ 1.769¢ .3171le 
.04 m+1.322¢ 1.927¢ .2022¢ m+1.030¢ 1.729¢ .3377¢ 
05 m+1.272¢ 1.8840 .2184¢ m+ .99le 1.697¢ -3548¢ 























since Z and s(m) are not statistically independent, the tests based on Z 
and s(m) are not necessarily preferable to those based on Z and s; even 
though s(m) is a superior estimate of the population standard deviation 
when the null hypothesis is true. Thus investigation of all three types 
of tests appears to be called for. 


DISCUSSION AND RESULTS 


First let us consider the definition and interpretation of the OC func- 
tion for the tests considered. The OC function for a test is defined to be 
the probability that both statistics have values within their limits given 
the true values of m’ and o’. When one or both of m’ and o’ differ from 
m and ao, respectively, the sample did not come from the population 
used in determining the limits. The purpose of the test is to detect this 
fact. Thus (same significance level) when m’ #m and/or o’ ¥<, the pref- 
erable test is the one which detects this lack of agreement with the 
highest probability. Consequently, given m’ and o’, the test to be pre- 
ferred is the one whose OC function is smallest. If the value of the OC 
function is small for specified m’ and o’, the probability that this devia- 
tion from m and o will be detected is high. On the other hand, when the 
OC function is large, the probability of detecting the deviation is small. 
Thus the OC function also represents a means of studying the ability of 
a test to detect different deviations from the hypothetical situation of 
m'=m and o’=c. 

This paper presents values of the OC function for a particular set of 
alternative hypotheses (i.e., values of m’ and o’) which appear to be 
representative of those of major interest. In this presentation m’ and 
o’ are expressed in terms of two equivalent parameters a and b, where 


a = V/n(m — m’)/o’, b = aja’. 








an—_ = 


weTClllrh TP lUDEEllClUW™—“‘i‘i hAH 
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The quantities a and b are more suitable for presentation of the results 
and arise naturally in the mathematical derivations. Examination of 
the OC function at the representative values furnishes a measurement 
of the properties of a test and a method of comparing the relative 
worth of two or more tests. 

The analysis is presented in terms of the OC function. An equivalent 
analysis could be made in terms of the power function by use of the 
relation 


Power Function = 1 — OC Function. 


Table 3 contains an OC function comparison of the three types of 
tests for a=.01; n=3, 5; a=0, 1, 2, 3, 4; b=1/8, 1/4, 1/2, 1, 2, 4, 8. 
Since the OC function has the same value for a= —c as for a=c, only 
positive values of a were tabulated. An exceedingly large amount of 
effort was involved in obtaining the OC function values for the tests 
based on t, s and on #, s(m). 

Examination of Table 3 shows that the test based on £ and s(m) is 
uniformly preferable to the other two types for b<1. That is, for b<1 
the OC function values for the tests based on # and s(m) were less than 
or equal to the corresponding values for the other two types of tests at 
all values of a and b considered. However, the values for the tests based 
on £ and s are near those based on ~ and s(m) for b<1. Thus these two 
types of tests are roughly equivalent in this range. The tests based on 
tand s are definitely inferior to the other two types for b<1. For b21, 
no type of test is uniformly preferable. For n=3 and b21, all the OC 
function values are large (even for a=4, b=8). Thus, for n=3, none 
of the tests considered are of much value for detecting deviations in 
this range. For n=5 most of the texts are suitable for b=8 although 
the tests based on #, s(m) become less sensitive as the numerical value 
of a increases. None of the tests has much chance of detecting the devia- 
tion when 6 lies in the range 2 to 4 (for both n=3 and n=5). Thus if o’ 
should happen to be between o/2 and a/4, the tests are all insensitive 
to even very large deviations of m’ from m. 

The over-all consensus seems to be that (for a=.01 and n=3, 5) the 
tests based on #, s and &, s(m) are roughly equivalent to each other and 
on the average superior to those based on t, s. However, the tests based 
on t, s do have the advantage of showing whether a significant result is 
due to a deviation of m’ from m, a deviation of o’ from a, or both. Also 
the t, s test for n=5 appears slightly preferable to the other two tests 
for n=3. Thus approximately the same accuracy can be obtained by 
taking a couple of extra sample values and using the tests based on t, s. 
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TABLE 3 
OC FUNCTION COMPARISON OF THREE TYPES OF TESTS FOR a=.01 


n=3 


















































TABLE 3 (Continued) 


n=5 
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TABLE 4 
OC FUNCTION VALUES FOR TESTS BASED ON 


n=3 








1/2 





-803 ‘ < ° -984 
-706 ‘ ‘ . -923 
-852 
-724 
-443 





-984 
-923 
-852 
-724 
-443 





-984 
-923 
-852 
-724 
-443 





-984 
-923 
-852 
-724 


443 





° -984 
-005 . -975 -980 -923 
‘ -004 . -937 -961 -852 
-0001 -002 d -857 -923 -724 
-0001 -001 ‘ -648 -816 -443 





























The fact that all the tests are insensitive for some of the larger values 
of b suggests that perhaps no attempt should be made to detect large 
values of b by tests of this type. This indicates that the limits for the 
dispersion statistic should be one-sided with the view of detecting small 
values of the parameter b. 

Table 4 contains OC function values of the test based on Z and s for 
a=.001, .005, .01, .02, .05 and n=3, 5, 7. Values of the OC function for 
n=4, 6 and for other values of a in the range .001 to .05 can be obtained 
from these values by interpolation. Thus Table 4 furnishes a basis for a 
rather comprehensive OC function analysis of tests based on Z and s. 

Examination of Table 4 shows that the tests using # and s are very 
suitable for bS1/4 and b28. In the neighborhood of b=2, however, 
this type of test has a “blind spot.” Thus, if the value of o’ should 
happen to be about half that of o, a test based on and s is very in- 
sensitive. 
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TABLE 4 (Continued) 






































n=5 
b 
a@ 1/8 1/4 1/2 1 2 4 8 
a 
.001 ' 004 -090 .688 .999 . 996 .949 .577 
-005 .003 .060 .571 -995 .982 . 804 . 164 
-010 0 -002 -049 .510 -990 -965 .677 .054 
.020 .002 -038 .444 .980 .934 .506 .010 
.050 -001 .026 .347 -950 .854 .250 -000 
.001 .003 .061 .575 - 993 .996 -949 .577 
005 002 040 -453 976 982 .804 164 
010 1 -001 032 .394 .960 .965 .677 054 
.020 -001 .025 .333 -933 .934 .506 .010 
-050 -001 .016 .250 .869 . 853 -250 -000 
.001 .0006 -019 .298 -930 .996 .949 .577 
-005 -0004 O11 -205 .845 -982 . 804 . 164 
-010 2 .0003 .009 - 167 .786 -965 .677 .054 
-020 -0002 .007 .131 .710 -934 .506 .010 
-050 -0001 .004 -089 .579 .848 -250 .000 
-001 .0001 .002 .078 .684 .996 -949 .577 
.005 -0000 .001 .045 .508 .981 -804 . 164 
.010 3 -0000 .001 .033 .421 -961 .677 .054 
.020 .0000 .001 .024 .332 .920 .506 .010 
.050 -0000 -0004 .014 -217 .794 -250 -000 
.001 -0000 -0001 .009 .302 .995 .949 .577 
.005 .0000 -0001 .004 - 164 .962 . 804 . 164 
-010 4 .0000 .0000 .003 -116 .914 .677 .054 
.020 .0000 -0000 -002 .076 .817 .506 -010 
-050 .0000 -0000 -001 .038 -582 -250 .000 

















To obtain a more accurate idea of the OC function properties of the 
tests investigated in this paper, the tabular values should be graphed 
and suitable curves fitted to them. It seems likely that some of these 
tests possess points of great insensitivity which are not sufficiently 
indicated by the tabulations at the particular values of a and b. The 
curves should furnish clews to the location of these points. This graphi- 
cal analysis will not be considered here. 


MATHEMATICAL DERIVATIONS 


This section contains derivations of the OC function expressions. 
The principal property utilized in the analysis is that Z and s are sta- 
tistically independent for samples from normal populations. 

Case of = and s. The upper limit for # is m+ Kgo/+/n while the lower 
limit is m—Kgo/+/n. Also the upper limit for s is xg¢/+/n—1 while the 
lower limit is x’a/+/n—1. The quantity Kg is the standardized normal 








| 1952 


Paeapaiaiins juste 


—aae @O = 
— ane = 
22626 


yhi- 


ns. 
ta- 


ver 
the 
nal 





TESTS OF THE STABILITY OF A NORMAL POPULATION 199 
TABLE 4 (Continued) 






































n=7 
b 
a 1/8 1/4 1/2 1 2 4 8 
a 

.001 -0004 -030 -572 -999 .988 -707 -000 
.005 .0002 .018 -449 -995 .949 .360 .000 
010 0 -0002 014 -389 990 -909 208 -000 
.020 .0001 .010 -327 .980 . 844 .094 .000 
.050 -0001 .006 .243 -950 .702 -018 -000 
.001 .0003 .020 .478 -993 .988 -707 -000 
005 0001 012 .356 076 949 360 .000 
010 1 0001 009 .301 960 909 208 -000 

20 .0001 .007 . 246 .933 . 844 -094 .000 
.050 -0000 -004 .175 . 869 -702 .018 .000 
001 0001 006 .248 930 988 707 .000 
005 0000 003 -161 845 949 360 -000 
010 2 0000 002 .128 786 909 208 .000 
020 0000 002 -097 710 844 094 -000 
050 0000 001 .062 579 698 018 -000 
001 0000 0008 -065 684 988 707 -000 
005 0000 0004 -035 508 948 360 .000 
010 3 | 0000 0003 .026 421 906 208 -000 
020 0000 0002 -018 332 831 094 .000 
050 0000 0001 .010 217 653 018 .000 
.001 .0000 .0000 .007 .302 -986 -707 -000 
-005 .0000 .0000 -003 - 164 .930 .360 -000 
010 4 .0000 .0000 .002 .116 - 861 - 208 -000 
.020 .0000 .0000 .001 .076 -739 -094 .000 
-050 -0000 .0000 -001 .038 -479 .018 .000 




















deviate exceeded with probability 8 while x,” and x,’ are the 1008 per 
cent and 100(1—8) per cent points, respectively, of the x?-distribution 
with n—1 degrees of freedom. That is, 


1- NK) = 8, f* faa = 8, f fealdx? = 6, 
0 Xg 


where 


N(z) = J . e-#/2qg fixe) = 5 La ala 
—e T(r/2) 


Now let us derive the OC function for the test based on £ and s. It is 
to be remembered that the sample considered is from a normal popula- 


tion with mean m’ and standard deviation o’. The null hypothesis 
tested is that both m’=m and o’ =o, where m and o are known. Now 
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OC Function = Pr (no significance| a, b) 
= Pr (m — Kgo/V/n < & < m+ Kgo/V/n, 
xs’o//n —1 <8 < xp0/V/n — 1) 
= Pr (m — Kgo/V/n < & < m+ Kgo/V/n) 
“Pr (xe’0/V/n — 1 <8 < x90/V/n — 1) 
= Pr [a — bKg < Vn (& — m’)/o’ <a + bKgl 
-Pr [b2x’? < (n — 1)s?/a’? < b2x,7] 
b?x," 
= [Na + 0K) — No — BK] J SoaGetit 
Here it is to be noted that »/n(Z—m’)/o’ has a standardized normal 
distribution while (n= 1)s*/a!*=Do(2-2)*/0" has a x*-distribution 


with n—1 degrees of freedom. 

The value of 8 corresponding to a given significance level a is found 
by setting a=0, b=1 and then equating the OC function to 1—a. For 
the case of # and s, the resulting equation reduces to 


i — (1 — 28)? =a. 


The procedure followed in obtaining the OC function values consists in 
first assigning a. Then 8 is determined. Using the value of 8, the OC 
function values are computed from the derived expression. This was 
done for a=0, 1, 2, 3, 4 and b=1/8, 1/4, 1/2, 1, 2, 4, 8. It is to be noted 
that the same OC function value is obtained for a= —c as for a=c so 
that the results also apply to a= —1, —2, —3, —4. 

Case of t and s. The upper limit for ¢ is t, and the lower limit is —t,. 
The upper limit for s is x,o/+/n—1 while the lower limit is x,’o/+/n—1. 
The quantity t, is the deviate of the t-distribution with n—1 degrees of 
freedom which is exceeded with probability y. The quantities x, and 
x7’ have the same definitions as for the case of # and s (with 6 replaced 
by 7). 

As in all cases, the OC function is the probability that both statistics 
lie within their limits given the values of a and b. Thus 
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OC Function = Pr (—t, < t < t,, xy'0/V/n —1 <8 < xy0/V/n — 1) 


t os a es 
= Pr[a- : ee )< a ee 








/n—1 a’ a’ /V/n 
t s/n — 1 s/n — 1 
+ pti), sar <2 ate] 
V/n—1 o’ o’ 


bry 
=2}  [N(a+ tyx/vn — 1) 
br, 
— N(a — tyx/Vn = 1) \xfna(x?)dx. 


It is to be observed that s\/n—1/0’ = V >. (2;— 2)?/o" has a x-distribu- 
tion with n—1 degrees of freedom; i.e., its probability density function 
is 2xfn—1(x’). 

For a given value of a, the value of y is found by setting a=0, b=1 
and equating the resulting OC function expression to 1—a. Using the 
value of y, the OC function is then computed for a=0, 1, 2, 3, 4 and 
b=1/8, 1/4, 1/2, 1, 2, 4, 8. In all cases the method used to obtain the 
OC function values was approximate integration of the integral expres- 
sion. Only a=.01 and n=3, 5 were considered. The same OC function 
values are obtained for a= —c as for a=c. 

Case of & and s(m). The upper limit for < is m+Kso/+/n and the 
lower limit is m—Kyso/+/n. The upper limit for s(m) is xs0/+/n while 
the lower limit is xs’o/+/n. Here the quantities xs’? and x,” are the 
1006 per cent and 100 (1—84) per cent points, respectively, of the x?-dis- 
tribution with n degrees of freedom (not n—1). That is, 





f  pulcdae = 8, f _footdaxt wi 
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The OC function is determined from 


OC Function = Pr [m — Kyo/V/n < & < m+ Kyo/Vn, 
xs'a//n < 8(m) < xs0/Vn] 


= —m’ 
= Pr[a—bKi < —= <a-+t bK,, 
a’//n 


x; — m)? 
b2x5/2 < he ( , ) < vx | 





og’? 


1 — [ B(z) P 
pape ate Sons (xd? e~* "dz, 
/2r a—bKs A(z) 


A(z) = max (0, b?xs’? — (z — a)?] 
B(z) = max [0, b?xs? — (2 — a)?]. 


Given a, set a=0, b=1 in the OC function and equate the resulting 
expression to 1 —a; the value of 6 is determined by this equation. Using 
the resulting value of 5, values of the OC function were computed by 
approximate integration for a=0, 1, 2, 3, 4 and b=1/8, 1/4, 1/2, 1, 2, 
4, 8. Only a=.01 and n=3, 5 were considered. The OC€ function has 
the same value for a= —c as for a=c. 
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MULTIPLE SAMPLING OF ATTRIBUTES 
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THE POPULATION 


three attributes X, Y, and Z. X is an r+1 class attribute, Y ac+1 
class attribute and Z a two class attribute; i.e., the M elements segre- 
gate into 2(r+1)(c+1) mutually exclusive groups with M ;;, elements 
possessing all three properties X;, Y; and Z,. The 2(r+1)(c+1) table 
depicting the population frequencies of the various combinations of 
attributes is then: 


Tc elements of a finite population of size M are characterized by 


TABLE 1 
POPULATION FREQUENCY DISTRIBUTION 








ze eee Re 








Teese F, Yo-:: Y- 





Moow-++ Meco Mo.o Myroo* ++ Mreo M,.o M..o 


Mon::> Moa Mo. Min: ++: Mere M, «1 M.. 

















Tent | Ba. +-+ Bit Be. | Mro.- ++ Mre. + we. 














It is supposed that the property Z is more difficult to measure than 
Y which, in turn, is more difficult to measure than X. 


THE DESIGN AND ESTIMATION PROBLEM 


This population is to be sampled for the purpose of estimating the 
proportion of elements possessing the property Z;. We shall consider 
the sampling design of Table 2 (random sampling without replace- 
ment) with fixed sample sizes N, n, and m and with 

Ns n Mi; Nij 


(1) ney 


N; N m n 
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TABLE 2 


THE MULTIPLE SAMPLING DESIGN 
M elements in the population 





A sample of N elements is drawn and then stratified by X 
No cee N; 





mo elements are drawn n, elements are drawn 
from the No sample ele- from the N, sample ele- 
ments having the prop- ments having the prop- 
erty Xo, and then strati- erty X,, and then strati- 
fied by Y fied by Y 

no Nr 








™Moo of the Noo ™Mro of the Nro 
elts. having Xo elts. having X, 
and Yo are and Yo are 
measured for Z measured for Z 


Moo | Moo00} 001 Mro | Mro0} Mroi 


——! —_ — —_—— 














Moc Of the Noe Mre Of the Mec 
elts. having Xo elts. having X, 
and YY, are and Yo are 
measured for Z measured for Z 


Moe | Moco} Moc Mre | Mreo| Meret 


eee 












































Equality cannot generally hold in these relations since the sample sizes 
must be integers. The symbol ~ is to be read “approximately equal 
to.” The statistic! 

(2) a= 


Ni ni Mir 


3g N un my 





1 In the special case where r =0 the statistic Z has the same form as the double sample estimate 
given by J. Neyman, “Contribution to the Theory of Sampling Human Populations,” Journal of the 
American Statistical Association, Vol. 33 (1938), pp. 101-16. The sample size m;;, however, is a chance 
variable in the present case instead of an available parameter as in the sampling model described by 
Neyman. When r =0 and c =1, the statistic 2 is identical to the regression estimate discussed by Chameli 
Bose, “Note on Sampling in the Method of Double Sampling,” Sankhya, Vol. 6 (1943), pp. 329-30; 
again, however, the sampling model differs from the present case. 
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is an unbiased estimate of M..,/M under the conditions that n;=0 if 
and only if N;=0, m,;=0 if and only if n;;=0. 


THE SAMPLING ERROR 


The restrictions (1) are not sufficient to permit an exact formulation 
of the variance of the statistic Z; a very close approximation can be ob- 
tained, however, by following the customary procedure of treating (1) 
as an equality relation. We then get 


. M—wN M..oM.. N-n M;.0Mi.4 
Va) ~ + p 
n—-m a M 50M ij 
Mnm i,j M3; —1 
M—wN M..oM.. N-n M;.0M;.1 


~ + du 


MN M? MNn ; M; 
> M 50M ij: 
Mnm 33 Mi; 
M-—-WN <@o? N —7n o7(1 — dre”) 
~ MN N n 


nm—m o.7(1 — Ye-zy") 








a- BS 








n m 
where go,” is the Bernoullian variance M..o>M..1/M? and ¢z,’ is K. Pear- 
son’s “mean square contingency.” 

M;.# | M;.oM;. 
(4) pte To ee ~ bi = b>» . - 
uh M;..M..2 M..oM..1. 3 Mj.. 
0S ¢.’ 31 
and ¥z.22 may be called a multiple coefficient of association, 


M... MinMin 9 


p> 


Baffle Be. 





(5) ; 1 





The estimate of variance 
1 E —N MmM..9m..1 N-n M; M;.0M;.1 





V(z) = — 
@) M N m(m — 1) Nn X m m-—i1 


(6) n—-m Mi; ae 


nm ij Mi mz— 1 





is biased only in that (1) is not an equality relation. 
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THE RELATIVE EFFICIENCY OF 2 


If a single sample of k elements were drawn and measured for the 
attribute Z only, the statistic 2,=k..,/k would be an unbiased estimate 
of M.../M and the variance of % would be 


V(&) M—k M..oM..4 M-—k o;" 
Mk M(M—-1). Mk 





Let k be the size of a single sample the total cost of which is equal to 
the cost of the multiple sample described above. Then the multiple 
sample estimate is more efficient than the single sample estimate if 


m 
k< 





, hen : 
N z2 . Ye-xy 


GENERALIZATION 


The extension of this theory to a general number of variables, say gq, 
is now obvious, and is briefly summarized in the general expression 


a a a 


Vors--aa = 1 — p 3 


a re eer Biss ie 








SPECIAL CASES 


This type of sampling scheme applies in a situation where the prop- 
erty Z, is of primary interest and where the cost of measuring an ele- 
ment for the attribute Z, say C,, satisfies 


C.2C,2¢C0,20. 
If C,>C,=C,=0, then 
M—m™ o,(1 — 2-2") 
M m 
If C,>C,=C,>0, then 


M-Net N-m of) —ea% 
V 3) ~———- ° 
(2) M a = 


Via) ~ 








If C,=C,=C,>0, then 


v@ M-—m 
. M 
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If C,=C,y>C.,>0, then 


M-—wWN C,.2 2 o7(1 — dz”) 
M N N m 


V(2) ~ 





APPLICATION 


The motivation for the development of this theory was supplied by 
the Curtis Impact Study, a national survey on magazine readership, 
conducted by the National Analysts, Inc., in November 1948. The 
survey had three primary objectives: (1) to obtain the family charac- 
teristics, interests, possessions, and buying habits of certain weekly 
magazine households, (2) to obtain essential characteristics of the 
readers of the magazines, and (3) to measure the impact of these maga- 
zines upon the readers. 

The sample design for this survey was a multi-stage probability 
sample of 31,298 households based upon the “area” method in which 
the 833 sampling units averaged about 37 households per segment. The 
sampling was done in several stages. Primary sampling units consisting 
of counties or pseudo counties for the open country and cities for the 
urban population were stratified and sampled as follows. 

The United States was divided into three primary strata, urban, 
village, and open country. The urban strata consisted of all incorpo- 
rated places having 2,500 or more people, the village strata consisted 
of all incorporated places having a population of less than 2,500 and all 
unincorporated places having a population of less than 2,500 and a 
population density of 100 or more people per square mile. The open 
country strata consisted of the rest of the population. 

For the open country and village strata, the 3,056 counties were 
grouped into 100 substrata which were homogeneous with respect to 
livelihood, soil and agriculture type. One primary sampling unit was 
drawn from each of the 100 substrata with probability proportionate 

to its 1940 population size. 

‘The urban strata were substratified by size of city and geographic 
location. Eight size classes were made from cities of less than one million 
people. Within each of these size classes the cities were ordered by geo- 
graphic areas averaging about one million people each. This resulted in 
63 urban substrata from each of which one city was drawn with prob- 
ability proportional to its size. Each of the primary units were then 
subsampled using segments as the secondary sampling unit. 

A double sampling technique was employed in the measurement of 
magazine readership within the sample segments. All households in a 
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segment were visited by the enumerator and a responsible adult mem- 
ber was briefly interviewed on the reading habits of each member of his 
household. In particular, this brief interview included the question: 


(1) The names of household members 10 years old and older. 
(II) Does (A) regularly read or look through (B)? 


where the blank (A) was filled in by the names given in (I) and the 
blank (B) by the names of four weekly magazines. A systematic sub- 
sample of households was drawn by selecting every twelfth household, 
beginning at a random starting point. The resulting sample with aver- 
age size of three households was further subdivided into two parts by 
taking a random-systematic sample of every third household in this 
subsample. The resulting split then averaged one and two households, 
respectively. At each of the two households an individual was ran- 
domly selected from the list in (I) and subjected to a long, intensive 
interview and among other things it was determined whether the indi- 
vidual had “read” one half or more of the magazine issues of the pre- 
ceding six months. At the remaining household every individual was 
subjected to this long interview. 

As a result of this sampling, every individual in the sample segment 
was placed in the described reader, nonreader, or don’t-know category 
by some responsible adult member of his household, and a sample of 
the individuals were also placed in the actual reader or nonreader 
category by an intensive interview. With regard to readership of a par- 
ticular magazine, then, the information taken from the segment classi- 
fied a sample of (say) n individuals into a 2X3 table: 


DESCRIBED READERSHIP 


NON- DON’T 
READER KNOW READER TOTAL 





NONREADER Noo N10 N20 
ACTUAL 
READERSHIP READER no nh na 























TOTAL No. Ny. Ne. 





and yields the marginal totals My., M,., Mz., M for the entire segment 
population. The double sample estimate of the segment total number 
of actual readers is then 

— 3 nN; 
M j= + am M ty 
imo 1 
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and the variance of the estimate depends upon the “mean square con- 
tingency” 
M > MiMi 


MoM. izo Mj. 





A sample analysis was carried out to evaluate the efficiency of the 
double sampling technique employed in the Curtis Impact Survey. At- 
tention was centered upon the estimate of total number of actual read- 
ers, and the primary objective was to compare the accuracy of the 
double sample estimate indicated above to the accuracy of an estimate 
calculated directly from the small sample on actual readership. The 
fulfillment of this objective required the estimation of the correlation 
between described and actual readership and the sampling error for the 
single and double sample estimates. Table 3 summarizes the results of 
the analysis on an average segment basis. The between segment com- 
ponent of sampling error was estimated for the urban zone and taking 
this component into account, the relative efficiency was reduced from 
265 per cent to 131 per cent. The efficiency would be reduced still 
further by taking into account the component of sampling error gener- 
ated by the primary sampling unit stage of sampling. The analysis of a 
related character, viz., number of household members, however, indi- 
cated that the between primary sampling unit component of error ac- 
counted for only 15 per cent of the total variation, and it might be 
surmised that this condition prevails also in the sampling of readers. 


APPENDIX 


In the following we shall treat (1) as an equality relation. The proof 
of the statements in the text is then facilitated by the use of condi- 
tional expectations. Note that the random variables appearing in this 
problem are: 

N ;..— observable n;..—observable m;..—observable 

N;;.——non observable n;;.—observable m;;.—observable 

N .j—non observable 1,;,—non observable m,j,—observable 
We prove the following statements from the text: 


I. The estimate 


Ni ny Min 


is an unbiased estimate of M..,/M under the conditions that n;=0 if 
and only if N;=0 and m,;=0 if and only if n;;=0. 
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Proof: 
For fixed samples of size N and n, E(Mia/m,j) =nia/nij, or 
Ni Ninn Ni Nia 
E(z) = —_-——— = —- 
ig N n; i N nm, 


For a fixed sample of size N, E(nja/ni) =Nia/Ni, or 








Nia Nua 
E(z) = = , 
= Lay 
and the average value of N..,/N over all possible samples of size N is 


M../M. 
II. The variance of the statistic Z is 


M—N o*® N—M oa,*(1 — ozs’) 
M N N n 


n—m o(1 — Y2-2y") 


V(z) ~ 














d n m 
Proof: 


(z) = E(z)? — [E(2)}? 


af ae nis mt - [==] 


ba nie Mi; M 


woz =) 
. X Ni Mij 

+o SAY ye ey y meh) 
ibink N N 5 a Mi; » Ne Mk; 


M.. 
mM 


Now, holding the entire sample fixed except for the random variables 
Mijn, We have 


r mis Nj 
E | . 


“ P > nis mis]? _ [“] + > (niz — Miz) Nijo Nis 
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hence, 
af Mempy ae 
are | = N | N?m 


ni 


Now consider the first member of (7), then for fixed n,, 
e| =) = Nia 
N; N; 


N; Ne 2 N..4 3 1 (N; — ni)Ni.oNi. 
” a(= N = he Wd eo n(Ni—1) 





Now consider the first member of (8), then removing all conditions, 
N..3\2 M..a\? M-—-N M...M..4 
a3) , 
N M N(M — 1) M? 
Now consider the second member of (8), then for fixed N;, 
NiAN; — 1) 
E(N;.0Ni.1) = M;.0M i. 
MM; — 1) 
(N; - aed 1 N-n»n N; M;.0M;.1 








> —- 


" - » 2 Bei’ 


N; M; 
(3)-% 
N M 


| 1 N-n N; ved 1 N-n M;.0M;.1 





#[—> 


‘N? i ni(N; - 1) 


and for variable N;,, 


p> xu 


N? oon SM, Mi-1l NM n “& M,-1 
Hence, the unconditional expectation of the first member of (7) is 
Ni 1.1]? M-—-N M..M.. 

ee eM 
i N "% N(M —_ 1) M? 
N-1n M;.oM;. M..;? 
+ x= “+ ——. 
MNn i M;-1 M? 


Now consider the second term in (7), then for fixed n;;, 
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Still holding N ;j, fixed but allowing n;; to vary 


Ny 
E(njj) = hi -—— 
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Now allowing N ;j, to vary but still holding N ;; fixed 
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Now allowing N;; to vary but still holding N; fixed 
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Now allowing N; to vary, the unconditional expectation is 
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and combining (9) and (10) we have 
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M-WN oo? N—n a,(1 — oz2’) 
M N N n 
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III. The bounds on 7.2? are OS 72.2? S1. 

Proof: 

The upper bound is obvious since M ;;,20. 

To prove that the lower bound is zero we must show that 


> MinMin MoM 
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or, more generally, for da, ba20 
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(11) al 
a=1 De a > (ae + ba) 


a=1 


This latter inequality is readily shown to hold for n=2. 
Let 


n 
An = D> Ga, 


a=] 


and assume that (11) holds for n—1, 
then 


= Qa ba < An Baa 
1 Ge + ba > Ani + Ba 
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since (11) holds for n=2, 
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IV. The estimate of variance 
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is biased only in that (1) is not an equality relation. 


Proof: 


Allow only m;,, to vary, then 
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Next, allow n;;, to vary, then 
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Next, allow N ;;, to vary, then 
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ESTIMATING THE PRODUCT OF SEVERAL 
RANDOM VARIABLES 


G. D. SHELLARD 
Metropolitan Life Insurance Company 


HIS problem arises because each of the factors making up the prod- 
cas can ta’xe many different values. The product can naturally take 
many values also. We want facts about this product; which values are 
most likely; how widely they are spread. 

An example may make the problem clearer. A ship crossing the ocean 
runs a risk on each trip. Let p; stand for the probability of safely com- 
pleting the z** trip. This depends on such things as the weather and, in 
war, the amount of enemy activity. These vary; so the particular value 
to be taken by any future p; can never be known in advance. But sup- 
pose that the distribution of p; is known, and that it is the same for all 
values of 7. For our example, let this distribution be given by f(p) 
=kp*(1—p)*, where k=1/B(10, 10/9) =12.99 (B representing the 
Beta function). This distribution is shown in Figure 1 and is all we 
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Figure 1. Distribution of the probability that a ship will 
safely complete one ocean crossing. 
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know about the probability of safely completing one trip. Now let P 
stand for the probability of safely completing ten trips. We know that 
P=rpi, (¢t=1, 2 - - - 10), but what values will P be likely to take? 

If we say that p; takes its mean value on each of ten trips, then 
P=(.9)'°=.3487. If we say it takes its modal value on all ten trips, 
then P = (.9878) !°=.8845. These estimates of central value do not agree 
very well. But how widely may we expect the values of P to be spread? 
The extremes are 0 and 1 of course, but even if we take 1 as the upper 
limit of p; and .8 as a practical lower limit, the range of P is from 
(.8)!°=.1074 to 1. 

Rough attempts to estimate the most likely values of P do not turn 
out well. More refined methods can be used in special cases. When the 
distributions of the p; are all rectangular, the method of Fisher [1], [2] 
can be used to give the exact distribution of P. When there are only 
two variable factors, the method of either [3] or [4] may be used. In 
other cases, the more general but approximate method developed below 
may be used. 

Let 21, 2, +++, 2i, °° * 2, be random variables with mutually inde- 
pendent distributions, and let X = 12. 

We wish to find the probability that X lies between A and B, 
Pr{ ASX SB}. Because of the unique logarithmic correspondence, we 
may write 


Pr {As X<B} =Pr{mnAsinX <InB} (1) 


and 
In X = DO In ay. 
1 


Since each z; is a random variable, In xz; is also a random variable. 
Subject only to conditions which make the central limit theorem ap- 
plicable, In X approaches the normal distribution with increase in n, 
so that 

1 In X=In B 
Pr {In A <InX <InB} “a f e- (In X-1)"/20°G (In X) (2) 


2m J in X=in A 


where uv and o? are respectively the mean and variance of In X. 
In (2), In X is the variable of integration. If the variable of integra- 
tion is changed to X, and if (1) is combined with (2), we obtain 


1 pe dX 
Pr {A $ X s B} > ——= f erin xn) 208 (3) 
A 


/ 210 xX 
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The right hand side of (3) is the expression for the probability that 
X lies between A and B in the logarithmic-normal distribution, and 
shows that X is distributed logarithmic-normally. Expression (3) thus 
gives the distribution of X except that the parameters yz and a have not 
been determined. These can readily be found by equating the first two 
moments of the distribution of X to the corresponding moments of the 
logarithmic-normal distribution. 

If we let u; stand for the mean and o;? for the variance of 2; the first 
two moments of the distribution of X are 


a, = E(X) = E(rz,) = x[E(z,)] = rus) 
a, = E(X*) = E(xz;*) = x[E(z,*) | 


rlad + ot] = (owt a(1+ <2)] 


The corresponding first two moments of the logarithmic-normal dis- 
tribution are 


Pe (4) 





(5) 


Q) = euto?/2 
and \ . 


a = e2tute*) 
Equating (4) and (5), we obtain 


- j 


Whi 
t= In = 


ry/ 1+(2) 4 (6) 





e o\? 
ot = In| rf + (=) \] 

L Mi ) 

Expression (3), with the values of » and o? shown in (6), gives the 
limiting distribution of the product X. The form of this distribution is 
illustrated by the curve in Figure 2. If we wish to find the shortest 
range within which most of the values of X lie, say 90 per cent, we go 
on as follows. On Figure 2 draw MN parallel to the X axis, cutting the 
curve f(X) at the points 7; and i. Drop perpendiculars from these 
points to the X axis. These mark off the range r1—r2 on the X axis and 
enclose under the curve the area rirziot:. This area represents the prob- 
ability that X will lie in the range r:—1r2. Now mark off on the X axis 
any other range, 7:’—r2’, which is equal in length to r:—r2. Perpendicu- 
lars raised from r,’ and rz’ enclose under the curve the area 1y'r2'72't;’. 
This area is surely less than rirzizi1, because the area lost, rir1'4:'t1, has 
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FiaureE 2. Distribution of the probability that a ship will 


safely complete ten ocean crossings. 
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greater height and area than the area gained, rere't2'i2. This shows that 
the way we fixed the points r; and r2 gives the range more likely to con- 
tain values of X than any other range of the same length. 

Values of r; and rz may be found analytically by letting h stand for 
the height of the line MN above the X axis and solving for the points 
of intersection 


1 
X) = ————_ (In X-w)*/20* = i, 
HX) V/2a0X ' 


Taking logarithms and solving for In X, we obtain 
In X = (u — o?) + Vo? — (un + In V2r)ch 


which are the values of In r; and In re. This shows that the values of 
In r; and In rz are spaced equally distant on either side of the point 


(u—o?). 





Now (2) may be written 


In B —- 
Pr (in A sin Xs InB} + 0(=——*) — o( 


o 


and if we set A=r, and B=r, and write 


and 


In A = Inn, = (u — o”) —a, 
In B = In re = (u — o*) +, 


nA-—u 
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we have e 


Pr {(u — 0?) -aSInX S (u — 0’) +a} 


a a 
+2(-0+ +) - o(-«--). (7) 
a o 


To find values of r; and rz which bound a range within which there is ) 
any desired likelihood “1” of X falling, we need only choose “a” so that | 
the right hand side of (7) equals “1.” Then 


Ti en-o*—a 


and . \ (8) 


fe = eve t+ 


To find values of r; and rz which bound the range of given length “r” 
within which there is the greatest likelihood of X falling, we set 


me-n= eu-2"(e8 -—-e*)=r 


and find 





(9) 


. “ ee ‘ 
sinh a = }(e e~*) a 
We now have developed the tools needed to deal with our example. 
What is the distribution of P, the probability of a ship safely complet- 
ing ten ocean crossings? The answer is given by (3) with values of u and 
o* given by (6). 


j gio 


w=In fi (=y7} = — 1.09926, 
1+ (—— 
9 
-0862\*]"° 
om nf + (282) ] "= oo. 


This distribution of P is shown by the curve in Figure 2. 

What is the shortest range of P within which there is a 90 per cent 
chance of P lying? The answer is given by (7) and (8). Since = .3022 
we see that 





#(- .3022 + =) 2 .90, 
. 3022 


(where (x) represents the probability that a unit normal deviate will 
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exceed x) so that, from a table of values of the integrai of the normal 
function, 


(- 3022 + <5) > 1.28, 
.3022 


A few trials show that a/.3022=1.72 is the approximate solution of 
(7), so that r:=.18 and r2=.51. There is thus about a 90 per cent 
chance that the value of P will lie between .18 and .51. 

What is the range of length .30 within which P has the greatest 
chance of lying? The answer is given by (9) and (8). 


; .30 
sinh a = ————— = .493. 


Qe-}- 19058 


From this we find that a=.475 and that r;=.19 and r.2=.49. 

In summary, this paper has developed a method of approximating 
the distribution of the product of a large number of mutually inde- 
pendent random variables by means of the limiting distribution as the 
number of factors increases. 
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SOME PRINCIPLES OF PROCESSING CENSUS 
AND SURVEY DATA 


Rosert B. Vorcut AND MARTIN KRIESBERG 
Bureau of the Census 


I. INTRODUCTION 


ESEARCH on conducting surveys has been marked by a dearth of 

study on processing. This lack is noteworthy since there have been 
rapid strides in processing as well as in such areas as sampling, question 
design, and interviewing techniques. Moreover, the cost of processing 
survey data is frequently a major part of the total survey cost. 

Underlying much of the study on improving survey techniques is a 
desire to obtain increased efficiency, i.e., to get data of acceptable re- 
liability at the lowest possible cost or conversely, to get data of greatest 
reliability with a given budget. Accordingly, the emphasis in sample 
design has been on the efficiency of the sample. Emphasis in question- 
naire design and interviewing techniques has also been on getting the 
most accurate response with the minimum time. Similarly, processing 
is concerned with the most accurate translation of survey data into 
published tables, with a minimum number of man days. In the over-all 
survey research design, savings made in any operation—sampling, in- 
terviewing, processing—may be translated into greater accuracy in 
another portion of the survey. For example, a saving of “X” dollars in 
processing can be translated into “Y” number of additional sample 
segments or interviews within segments thereby reducing the sampling 
error, or into “Z” number of additional questions on the same point, 
thereby reducing possible errors in categorizing responses. 

This paper describes some of the procedures used, principally at 
Census, to increase the efficiency of processing the census and sample 
surveys. While many efficiencies were introduced in individual opera- 
tions, only those which have general application are discussed here. 


II. PROCESSING A SAMPLE OF THE RETURNS 


In many surveys it is, of course, necessary that every return be 
processed. This is the case in the decennial census where every indi- 
vidual in the country must be counted. It is also the case in surveys 
where only a sample of the universe is interviewed. 

In instances where returns are secured from virtually 100 per cent 
of the universe, however, it may frequently suffice to draw a sample 
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from the returns and from that sample estimate characteristics of the 
universe. Such instances occur, for example, where data are originally 
collected from everyone in the population for some purpose such as 
regulation or control, but the study at hand is concerned primarily 
with determining certain characteristics of the universe. 

Examples of processing a sample of returns may be cited from Census 
experience and from the experience of some of the defense agencies. In 
the Bureau of the Census, statistics of family composition and charac- 
teristics are based on samples drawn from the universe of people col- 
lected on 17th Decennial Census schedules. The War Production Board 
(World War II) obtained inventory reports from virtualiy 100 per cent 
of the firms holding the prescribed inventories. For purposes of estimat- 
ing trends in inventories, a sample of firms was drawn from the universe 
of reports and only this sample was processed.' In both instances, the 
estimates that could be made from the sample were accurate enough for 
the purpose. The savings in processing a sample of returns rather than 
all is self-evident. 


III. ESTABLISHING FUNCTIONAL DIVISION OF THE OVER-ALL 
PROCESSING JOB 


The efficiencies of specialization are generally recognized. The ad- 
vantages of specialization are no less real in processing survey data 


than in other mass operations. Some of the advantages as they pertain 
to survey operations may be cited: 

First, the assigned tasks can be learned more readily. The time re- 
quired for training personnel and the time required for them to reach 
optimum levels of productivity is reduced when a job is broken down 
into simple units. This is especially important where the personnel 
available for the job is inexperienced, as is likely where large numbers 
must be brought in to do a job in a relatively short time. Reduced 
training time and reduced learning time are also important where there 
is the possibility of high personnel turnover (e.g., where people are 
hired to do a job on a temporary basis, or when there is a tight labor 
market). With a smaller investment in training and learning, losing 
personnel before they have completed the task is not so costly. 

Second, specialization may improve the quality of the work. Fewer 
errors are likely to be made when the task assigned is not complex, and 
where the operations are done repeatedly. Moreover, clerks may be 





1 For a detailed report on how this was done in one OPA office, see [1]. 
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tested for special aptitudes related to different types of operations and 
such screening may result in better work in each operation. 

Third, specialization can increase the speed of doing the job. When 
operations are repeated again and again, decision time is reduced to a 
minimum. The operator makes the necessary decisions with little re- 
flection. Moreover, operators and first line supervisors are more likely 
to devise small short cuts in an operation which they can grasp in its 
entirety and with which they become thoroughly familiar. 

Specialization does introduce additional problems of control and, 
possibly, excessive handling of schedules. One means of meeting these 
problems is to establish multiple operation units. In the 17th Decennial 
Census, for example, several processing operations were included under 
a single administrative supervisor. While some problems of work back- 
log were thus introduced, the movement of schedules from operation 
to operation took less time, and controls covering this movement 
within the unit were unnecessary. 

In dividing the work of processing, consideration should be given not 
only to such divisions of work as between coders, punchers, and ma- 
chine operators, but also between different facets of the editing and 
coding operations. Thus, for example, it may be well to have one group 
of editors “screen” returns to determine whether or not they are of a 
quality to be processed further or whether they are deficient in certain 
respects and therefore require special attention. If considerable indi- 
vidual editing is required, it is less costly to pull out the faulty returns, 
correct them centrally, and then put them into the regular processing 
channels, than to have them crop up to impair the smooth flow of 
subsequent coding operations. This technique was employed in proc- 
essing 17th Decennial Census data.? An initial screening operation was 
devised to recognize returns in which the enumeration was not of ac- 
ceptable quality and which therefore required either detailed editing or 
re-enumeration. 

At times it is also advisable to divide the coding operation between 
several teams of coders so that the operation may move along in “as- 
sembly-line” fashion. For example, there is a basic difference between 
“rote” codes and “research” codes.* The operation of entering codes 
which require little or no reflection and no reference to code lists, may 
well be separate from an operation where coding is done almost entirely 





2In the 1940 Census, all work went directly to the code clerks and as problem cases were encoun- 
tered they were referred to ad hoc groups of technical assistants. These groups gr~w in number as process- 
ing progressed and segments of work were constantly held up as faulty enumeration was corrected. The 
procedure of 1950 profited by this earlier experience. 

3 For a detailed discussion on types of codes and coding procedures see [2]. 
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by reference to a lengthy manual of code categories. Where persons 
specialize in entering codes on the basis of reference, they acquire a 
proficiency in handling their lists and manuals, and therefore the time 
of such research may be kept at a minimum. This technique was ap- 
plied in the 17th Decennial Census. In coding the basic Population 
Schedule, the major portion of the job was reduced to rote coding. On 
the other hand, entries for occupation and industry had several thou- 
sand code categories, and many of the responses could be coded only by 
referring to a code book of all possible industries and occupations. 
These codes were entered by a separate group of coders who became 
proficient in recognizing descriptions of occupations and industries, and 
in locating their appropriate codes in the manual. 

Where codes may be entered from memory without reference, there 
are advantages in making the coding operation as mechanical as possi- 
ble, assigning the task of handling problems to other operations or 
people. For example, in the 17th Decennial Census, while the enumera- 
tor made entries for an individual on horizontal lines, the basic coding 
job was done by vertical coding. That is, the coder was generally ad- 
vised to code “relationship” for all persons listed on the page, their 
“country of birth” for all persons on the page, and so on, without trying 
to check consistency between entries for a given individual. Any prob- 
lem encountered by the coder was referred to a technical assistant who 
was always available in the operating unit. This made it unnecessary 
for the clerk to acquire much technical competence; instead he could 
concentrate on the basic coding job. 


IV. MECHANIZING THE PROCESSING OPERATION 


As in many industrial operations, one way of reducing over-all costs 
of processing larger surveys is to increase the amount of work done by 
machines, and to reduce the amount of work which must be done by 
hand. Perhaps the greatest strides in processing survey data have been 
made in increasing the proportion of mechanical work. The trend to- 
ward more mechanized processing is the result not only of improved 
equipment, but also of an increasing awareness of the uses to which 
machinery can be put. 

A basic assumption of mechanized processing is that coding by 
punchers is frequently less expensive than manual coding plus punch- 
ing, and that mechanical editing for many types of inconsistencies is less 
expensive than manual editing. Accordingly, an efficient processing 
procedure would make optimum use of manual coders, punchers, and 
tabulating equipment. The emphasis would be on efficiency in the over- 
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all processing job—even at the expense of any one of the operations, 
Some guides to the functional division of processing are given belew. 


(a) Manual Coding 


The manual coder need enter only those codes which could not be 
done more efficiently by the puncher, and he would do only that editing 
which could not be done mechanically. Accordingly, the job of manual 
coding might be confined to the following types of schedule responses: 

First, responses for which the code categories are so numerous or 
complex they require considerable judgment or referrals to complex 
code schemes and manuals.‘ Such responses are inefficient for punch- 
coding since the puncher would need to interrupt his operations too 
frequently. 

Second, responses which, in general, might be punch-coded except for 
the fact that in a small proportion of the cases the entries are so varied 
they require reference to a manual. Although every response to each 
such schedule question might be examined by the code clerk, he need 
enter codes only in those cases where reference to a manual is necessary. 

Third, responses which require inspection of the entries in several 
portions of the questionnaire or more than one schedule for correct 
editing and coding of the individual entry. Such responses conceivably 
could be coded by the puncher but the need for page turning or schedule 
comparing makes the coding inefficient for a puncher. Similarly, where 
the data to be checked for consistency is punched on to more than one 
IBM card, mechanical editing may not be feasible since the tabulating 
machine inspects only one card at a time. 


(b) Punching 


In order to effect substantial savings in coding, the task of coding 
should be combined with punching when possible. The limitations to 
punch-coding are, of course, set by the questionnaire format and by the 
kinds of data collected. Thus, for example, open-ended questions where 
respondents are asked to give their opinions on a subject, do not lend 
themselves to punch-coding. On the other hand, questionnaires in 
which the code categories are already listed and numbered, may readily 
be coded by the card puncher. In the latter instance, the puncher can 
punch-code as rapidly as if the question had been coded manually first. 





4 Coding responses to open-ended attitude questions illustrate this; each response must be care- 
fully weighed in terms of the categories established and therefore the manual coder can probably do the 
work as rapidly as the puncher. 
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Several experiments conducted as a pretest to the census processing, 
gave ample evidence that punch-coding within the limits indicated, was 
both accurate and efficient. 

Other questions that do not lend themselves completely to punch- 
coding may be handled by a combination of manual and punch-coding. 
Thus, for example, a question for which many possible responses were 
pre-coded, but which also permitted write-in-answers other than one of 
the numbered categories, could be handled by a combination of punch- 
coding and manual coding. The manual coder would need to code only 
the write-in-answers (usually a small proportion of the cases). The 
puncher could then punch all pre-coded responses and, in the case of 
write-ins, the codes entered manually. 

The job of card punching may itself be reduced by “X-skip bar” ar- 
rangements. This is a mechanical device by which the puncher is not 
required to punch “inapplicable” codes for a portion of the punch-card, 
but instead may punch an X at the beginning of the inapplicable field 
and the bar will automatically cause the machine to skip the field so 
the operator may begin punching in the next relevant field.5 Thus, for 
example, in a punch-card used for persons 14 years of age and above, 
as well as for children of less than 14, certain characteristics such as 
earnings, veteran status, marital status, etc., may not be relevant for 
those under 14. With the X-skip bar arrangement, it is not necessary 
for the operator to punch a code for these persons indicating the inap- 
plicability of the field; he may simply punch X at the beginning of the 
several fields, skip the required number of card columns, and then be- 
gin punching for the next relevant field. 


(c) Mechanical editing while tabulating 


Special mechanical edits can be prepared which will single out cards 
which contain errors. Then by examining these cards, errors in them can 
be corrected.* In a small scale survey such an operation may be well 
worth while. However, on a project as large as a Decennial Census, 
where a tenth of one per cent of the cards amounts to 150,000 cards, 





5 In order to accomplish skipping of a portion of the punch-card, a small metal bar is made up in 
accordance with the skip pattern desired. This bar is fitted into the punch-machine and operates when 
the “X” key is punched. The manufacturers of tabulating equipment, have several pamphlets detailing 
this and other aspects of their several machines. Moreover, their representatives are frequently able to 
make on-the-spot suggestions on how to accomplish specific jobs most efficiently. 

® Several kinds of errors may be found in the punched cards, e.g., codes which the operator failed to 
punch, codes which are outside the established categories, codes which are within established categories 
but obviously inconsistent with other codes. The last type of error can often be corrected by examining 
the card manually and then having it repunched to make the faulty code consistent. with other relevant 
codes; in some cases the other types of errors may be similarly corrected. 





228 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


and where the processing work is divided between many groups of 
people, the cost per card of matching these rejected cards with the 
original schedules is increased, and the volume of work is so great that 
the cost of correction becomes quite large. 

Tests conducted at the Bureau of the Census indicate that the pro- 
portion of error cards selected for correction is so small that the expense 
of such an operation is unwarranted for most items. Also, despite such 
mechanical edits, the so-called “perfect deck” still contains imperfec- 
tions because (1) some errors inevitably slip through the mechanical 
edits, and (2) errors are inserted into the cards after the edit, due to 
carelessness on the part of tabulating personnel in later operations. 
Accordingly, it is still necessary to incorporate consistency checks in 
the tabulations to eliminate such errors in the final results. 

It is more efficient and less costly mechanically to examine and allo- 
cate error cards to acceptable categories as a part of the regular tabulat- 
ing processes.’ Thus making use of the editing features of the tabulating 
machines concurrently with the processing of tabulations, such residual 
errors or inconsistencies as may appear in the cards can be eliminated 
from the tabulations without removing and correcting the cards. 


V. ESTABLISHING QUALITY CONTROL PROCEDURES 


In any large processing operation on which many clerks are em- 
ployed, errors will be made. Despite verification and re-verification, 
and despite the most careful controls at each step of the operation, 
many of these errors will be carried forward to the end of the job and 
will be found in the machine sheets on which the data are listed. A 
disproportionate amount of the processing budget can be spent in try- 
ing to remove all errors; it is therefore well to recognize that a residual 
error will persist. If published tables are to be in perfect balance, not 
only internally but with those containing similar data, it will be neces- 
sary at the end to make some arbitrary adjustments to bring them into 
agreement. Accordingly, it seems reasonable to make every effort to 
keep errors at a minimum, but not to spend an excessive amount of 
time or money in attempting to ferret out and correct every last linger- 
ing error in the data. Rather it seems appropriate to deal with residual 





7 Such allocations of errors are usually done on the basis of prescribed rules of consistency between 
different items on the card. For example, the tabulating machine may be so wired that when a card for 
an individual contains the relationship code for “husband” and on the same card the code for age is “9” 
and the code for sex is “female,” the card will automatically be tabulated as “other relative” rather 
than “husband.” The code “other relative” is an “acceptable” category for relationship since it is con- 
sistent with the codes for age and sex of this person. Such mechanical edits and allocations are accom- 
plished by ad hoc wiring of the tabulating machine board. 

The writers wish to acknowledge the work of Morton Boisen in this phase of the Census processing. 
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errors in a statistically acceptable fashion which will minimize the 
likelihood that statistics for any given characteristics or geographic 
area Will be biased as a result of such errors. A processing budget should 
be designed not so much to catch and correct all errors of enumeration, 
coding, punching, or tabulating, but to assure that any consistent 
errors, Which might affect published data, will be removed. 

One of the ways in which consistent errors in an operation can be 
caught with a minimum amount of time, is to devise a screening opera- 
tion. A relatively quick perusal of a schedule can spot major omissions 
or inconsistencies between critical items. Moreover, the work of a par- 
ticular interviewer or a particular group of interviewers working out of 
one office is likely to show a disproportionate number of errors. This 
clustering of errors makes it feasible to screen only a sample of the work 
done by an individual enumerator, or by one field office. It may be 
assumed that the work of most interviewers will be acceptable ; screen- 
ing is necessary to catch the occasional case where poor work was done. 
The same tendency for errors to cluster—as a result of work by poorer 
operators—is likely to be repeated in other operations. Accordingly, 
similar procedures of screening may be applicable in several processing 
tasks. 

If processing is to be carried on within acceptable standards of qual- 
ity, some verification procedure is required. At the beginning of an 
operation it may be necessary to install verification of all work done in 
the processing. New personnel are more likely to make many errors and 
100 per cent verification is therefore necessary if the error rate is to be 
reduced to acceptable levels. After an operator is performing at ac- 
ceptable quality levels, however, the purpose of verification should be 
to assure the continuance of processing within the quality standards. 
Sample verification may be installed which will have the dual function 
of catching poor runs of work and clerks who occasionally grow careless. 
While verifiers should correct errors they find, sample verification 
should be viewed as a procedure for measuring the quality of the opera- 
tion, rather than a means of improving it.® 

In a large operation in which many verifiers are employed, there 
must be some means of checking the quality of the work done by 
verifiers. Typically, this is done by spot re-verification—that is, the 
supervisors may periodically review the work done by verifiers, or a 
smaller group of verifiers may be assigned the specific task of verifica- 
tion. Another procedure that may be employed is that of recording or 





8 We wish to acknowledge the work of Joseph Steinberg and of William Hurwitz in this phase of 
the Census processing. For a detailed account of this work in the 1940 Census, see [3]. 
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“planting” errors before work goes on to verifiers. In this method, errors 
which are found in a coder’s work are recorded and the verifier’s report 
of errors that he found is later compared with the record of errors which 
are known to exist. In cases where very few errors are made, and where 
feasible, errors may be “planted.” Error recording or planting is more 
efficient than other means of re-verification and is particularly valuable 
where error rates of the original work are low. 


VI. ESTABLISHING STANDARDS OF PRODUCTION 


The value of establishing standards of production for plant opera- 
tions has been recognized by indusirial engineers for some time. Pro- 
duction standards for survey processing operations are no less valuable. 
At the risk of stating the obvious, some of the reasons why production 
standards are important in survey processing should be noted: 

First, an announced standard of production gives operators a goal 
toward which to work. Most workers want to know what is expected of 
them and are anxious to live up to production requirements. In the 
absence of standards, the operator works at his own estimate of what 
represents “an honest day’s work” ; this may or may not be up to stand- 
ards acceptable to management. 

Second, the establishment of production standards makes it possible 
to set production levels for administrative recognition and reward, and 
for administrative disciplinary action. Thus operators achieving pro- 
duction rates considerably above the average, may be given recognition 
in the form of honor roll listing, performance ratings, bonuses, or . 
promotions in proportion to the amount of acceptable work produced. 
Operators performing considerably less than the standard may be sub- 
ject to reassignment, termination, or other disciplinary action. Using 
administrative rewards and sanctions to increase productivity is likely 
to be ineffective, if not impossible, unless standards are fairly set. 

Third, there are two by-products of production standards which are 
also important: (a) The need for delineating the different steps of an 
operation, and of obtaining accurate time measurements of those 
steps—in order to establish production standards—facilitates budget 
estimating for similar operations. When the component parts of an 
operation have been accurately measured, and the time costs deter- 
mined, the budget for such operations becomes a matter of adding the 
time cost of each component. For example, if a transcription operation 
includes such steps as assignment procuring and return, sample selec- 
tion, posting, and verification, and if the cost of each of these steps is 
determined, subsequent processing jobs which include similar steps 





SOME PRINCIPLES OF PROCESSING CENSUS AND SURVEY DATA 231 


may more accurately be estimated. (b) The need to take time measures 
of an operation, and to observe the flow of work objectively, frequently 
leads to improvements in the procedure. Weaknesses in established 
procedures, such as unnecessary handling of papers, can frequently be 
uncovered by such observations. Moreover, the need to commit man- 
agement to a standard of production forces careful appraisal of the 
efficiency of the operation before production standards are announced. 


VII, CONCLUSION 


These principles have been formulated on the basis of procedures put 
into operation at the Bureau of the Census and in other survey research 
organizations. They represent the contributions of many individuals 
and groups, in the Census and elsewhere, only a few of whom could be 
mentioned in the footnotes. As indicated, some of the assertions are 
supported by evidence collected in ad hoc experiments; others are based 
on differences in performance when procedural changes were introduced 
in going operations. The principles are offered not as immutable laws 
for survey researchers, but as suggestions which may make for more 
self-conscious application of processing procedures. 

There is no single best way of processing all surveys. Just as sample 
designs and questionnaires are pre-tested before a survey goes into the 
field, so processing procedures should be pre-tested before scores of 


people are assigned to the job. Only by recognizing and testing alterna- 
tive procedures and by critically evaluating their efficiency can further 
advances be made in this area of survey research. 
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SHORT-CUT METHODS OF ESTIMATING 
COUNTY POPULATION 
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HIs study tests the accuracy of selected short-cut methods of esti- 
mating the postcensal population of counties. Twenty series of 1950 
estimates based on various kinds of symptomatic data and techniques 
were computed for the thirty-nine counties of Washington state. These 
estimates were then compared with preliminary 1950 U. 8S. Census data 
to gauge the relative accuracy of each method. 
Two series were computed by conventional mathematical methods, 
twelve were based on simple symptomatic data, and six were developed 
from weighted combinations of the twelve simple symptomatic series, 


ARITHMETIC PROJECTION AND APPORTIONMENT 


The two conventional mathematical techniques tested were arith- 
metic projection and apportionment. 

Arithmetic projection assumes that yearly absolute growth in the 
postcensal period equals the average yearly increase in the most recent 
intercensal period for the area in question. State health departments 
frequently use this method for small area estimates, and several recent 
textbooks recommend it.! 

Apportionment was the second method to be tested. “From 1926 
until 1934 the Bureau of the Census made its estimates for state and 
city population by the apportionment formula. This formula assumes 
that the estimated postcensal increase in the national population was 
divided among these states or cities according to their shares in the na- 
tional increase of the previous intercensal decade.”? For the present 
paper, the method was refined by use of state (rather than national) 
increase data. In practice, this would mean using the yearly state popu- 
lation estimates issued from time to time by the U. S. Bureau of the 
Census. In the present study, however, actual preliminary counts of 
the 1950 Census were used. 

Neither method proved to be very accurate. Estimates based on 





* The assistance of Albert H. Crosetti is gratefully acknowledged. 

1 See, for example, the Washington State Health Department, “Population of the State of Washing- 
ton by County, City and Rural Areas, July 1, 1951,” Seattle, October 1, 1951 (mimeographed); Ruth 
Rice Puffer, Practical Statistics in Health and Medical Work (New York: McGraw-Hill Book Co., Inc., 
1950), p. 78; Harry 8S. Mustard, An Introduction to Public Health, 2nd Edition (New York: The Mac- 
Millan Co., 1947), p. 21. 

2 Henry 8S, Shryock, Jr., “Methods of Estimating Postcensal Populations,” American Journal of 
Public Health, Vol. 28 (1938), p. 1042. 
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arithmetic projection had a mean deviation (signs disregarded) of 16.3 
per cent from the preliminary census counts. Estimates made by the 
apportionment method had an average “error” of 28.9 per cent.? Fur- 
ther information is given in Table 1. 


TABLE 1 


PERCENTAGE DEVIATION (SIGNS DISREGARDED) USING 
ARITHMETIC PROJECTION AND APPORTIONMENT 








Method Mean C.R.* 





Arithmetic projection 
Apportionment 





* Between given mean and mean (4.8 per cent) for most accurate series tested (see Table 3). 


SIMPLE SYMPTOMATIC SERIES 


Estimates using simple symptomatic data were based on two differ- 
ent methods and six kinds of data. 


Kinds of symptomatic data 


The following types of data were used: live births, deaths, public 
school enrollment, registered passenger automobiles, registered voters, 
and county welfare recipients. These indices were chosen because they, 
more than others currently available, appeared to satisfy the following 
requirements: 

1. Accurate official records are kept of each and are summarized at 

least biennially in published reports. 

2. Comparable data are available for both the postcensal period and 
the time of the most recent census (important for only the second 
method described below). 

. Each kind of data comprises a large number of cases, or a rela- 
tively large percentage of the population of which it is sympto- 
matic. 

4. The ratio of symptomatic data to total population is relatively 
constant from county to county and year to year. 

Certain characteristics of the six types of data are shown in Table 2. 

With regard to the third criterion listed above, it would appear that 
voter registration is the best index of population; there were only 2.02 





3 Based on the U. 8S. Bureau of the Census, 1950 Census of Population, Preliminary Counts, Series 
PC-3, No. 4, p. 14. The thirty-nine counties had an average population of 60,597, with a range from 
3,191 to 726,464. 





234 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


persons per registered voter in Washington state in 1950. The highest 
ratio was that of deaths to total population (108.45). Inter-county dif- 
ferences, as measured by the coefficient of variation, were lowest in 
1950 for population per registered voter (C. V.=11.9) and highest for 
population per welfare recipient (C. V.=54.9). The correlation coefi- 
cient between 1940 and 1950 county ratios of population to sympto- 
matic data ranged from +.36 for live births to +.85 for public school 
enrollment. 


TABLE 2 
RATIOS OF POPULATION TO SYMPTOMATIC DATA 








Ratiec (for entire Coefficient of variation Correlation (r) 
state) (39 counties) between 1940 

and 1950, 

1950 1940 1950 by county 








Live births ¢ 43.10 ‘ J +.36 
Deaths . 108.45 ‘ ° +.51 
School enrollment ‘ 5.76 ; +.85 
Auto registration J 3.22 i +.76 
Voter registration Z 2.02 i P +.38 
Welfare recipients . 14.52 5 54. +.64 





Proration method 


The third technique tested, hereafter referred to as the proration 
method, consisted of a simple allocation of the postcensal (1950) state 
population on the basis of each type of symptomatic data listed in the 
preceding paragraphs. It was based on the assumption—obviously not 
entirely true—that the population-to-symptomatic data ratio was equal 
for all counties. Its chief advantage lay in its simplicity and inde- 
pendence of decennial census data. 

The proration method, like apportionment, depends on an accurate 
postcensal estimate at the state level. For most practical purposes, the 
annual state estimates reported by the U. S. Bureau of the Census 
should be satisfactory. Actual preliminary census counts were used in 
the present paper, but this modification would not appreciably affect 
relative accuracy. 

Accuracy of the proration method varied greatly, depending on the 
symptomatic data used. The best series was that based on voter regis- 
tration, with an average “error” of 8.7 per cent. Series computed from 
automobile registrations, births, deaths, and public school enroliment 
had mean deviations, signs disregarded, ranging from 11.9 to 15.8 per 
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cent.4 Welfare recipients proved to be the most inaccurate basis, with 
a mean deviation of 25.2 per cent. 
Additional information is given in Table 3. 


TABLE 3 


PERCENTAGE DEVIATION (SIGNS DISREGARDED) USING 
SYMPTOMATIC DATA 








Proration method Censai ratio method 





Basis of estimates 
Mean o C.R.* Mean o C.R.* 





Single symptomatic series 
Live births 
Deaths 
School enroliment 
Auto registration 
Registered voters 
Welfare recipients 


.84 
-12 
-63 
-1l 
-35 


— te 
ameraa aw 
NN CONN 


to 


Weighted combinations 
Equal weighting (6 series) 
Weighted by ratio to population (3 seriest) 2 
Weighted by 1/C.V. (3 seriest) “ J -11 





* Between given mean and mean (4.8 per cent) for censal ratio combination weighted by 1/C.V. 
t See text for lists of various series used. 


Censal ratio method 


The fourth technique tested, hereafter referred to as the censal ratio 
method, involved the following steps: 

1. Computation of censal (1940) ratios of population to symptomatic 

data for each county. 

2. Application of censal (1940) ratios to postcensal (1950) sympto- 
matic data to obtain tentative postcensal county population esti- 
mates. 

. Adjustment of tentative county estimates to make them total an 
independent statewide estimate (in this case, the actual enumer- 
ated state population). 

The importance of this final adjustment should be emphasized. 
Birth, death, and automobile registration rates changed considerably 
between 1940 and 1950; use of the tentative (unadjusted) estimates 
taken directly from the 1940 ratios would have produced extremely in- 





4 See Donald J. Bogue, “A Technique for Making Extensive Population Estimates,” Journal of the 
American Statistical Association, Vol. 45 (1950), pp. 149-63, and Carl Frisen, “Symptomatic Data in 
Population Estimates: Problems and Limitations,” Research Studies of the State College of Washington 
(“Proceedings of the Pacific Sociological Society”), Vol. 19 (1951), pp. 107-10, for further evaluation of 
birth and death rates as a basis for population estimates. 
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accurate estimates. The adjustment assumes that the symptomatic 
ratio changes in the same degree, percentagewise, for each county. 

Average deviations range from 7.0 to 33.2 per cent (see Table 3). The 
lower figure was achieved by both the enrollment-based series and the 
one derived from voter registrations. Registered automobiles, live 
births, and deaths had somewhat less value as symptomatic data, with 
mean “errors” of 11.1 to 16.3 per cent. The estimates based on welfare 
recipients differed most from the census counts. 


WEIGHTED SYMPTOMATIC SERIES 


The six remaining series of estimates were made using various combi 
nations of the estimates based on simple symptomatic data. 

It was assumed that an underestimate based on one type of sympto- 
matic data would tend to cancel out an overestimate based on another 
kind. This would be true to some extent if the directions (signs) of 
“error” among the various series had low intercorrelations, and, to a 
much greater extent, if the directions of “error” had high negative inter- 
correlations. The former was more generally correct; among the six 
series based on the censal ratio method, for example, intercorrelations 
of the direction of deviation fell, with two exceptions, between —.44 
and +.37.5 


Equal weighting 


The first weighted series to be tested consisted of a simple average of 
the preceding county estimates. Each estimate was thus assigned equal 
weight. Separate averages were computed for the six series based on the 
proration method and those made with the censa! ratio method. 

In neither instance was the average deviation unreasonably large 
(see Table 3). Mean “error” for the combined proration method esti- 
mates was 8.2 per cent. Combining estimates made by the censal ratio 
method resulted in an average deviation of 8.0 per cent. 


Weighting by the ratio to population 


The second weighting technique assumed that accuracy was reiated 
to the ratio of symptomatic data to population. (Actually, the coeffi- 
cient of correlation between the reciprocal of this ratio and mean devia- 
tion, signs disregarded, was +.04 for the six proration series and +.21 
for the six series based on the censal ratio method.) Estimates based on 





5 The extreme values were —.81 (series based on live births and automobile registration) and +.93 
(school enrollment and welfare recipients). 
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the three kinds of data with the highest 1940 statewide symptomatic 
ratios (voter registration, automobile registration, and public school 
enrollment) were accordingly combined, using individual weights based 
on these statewide ratios. 

Results were relatively good (see Table 3). For the weighted combi- 
nation of estimates made by proration, average “error” was 8.4 per 
cent. The combination of estimates made by the censal ratio method 
had a mean-deviation of 6.0 per cent, second best of the twenty series 
tested. 


Weighting by 1/C.V. 


The third weighting system assumed that accuracy was associated 
with geographical uniformity of the symptomatic ratios. (In reality, 
the correlation between the postcensal mean deviation, signs disre- 
garded, and the 1940 coefficient of variation was +.92 for the six series 
based on proration and +.97 for the six made by the censal ratio 
method.) Estimates based on the symptomatic ratios with the lowest 
1940 C.V.’s (voter registration, school enrollment, and live births) were 
therefore combined, using individual weights based on the reciprocals 
of these coefficients (see Table 3). 

This weighting method produced the lowest average “error” of the 
twenty tested (see Table 3). The combination of censal ratio method 
estimates had a mean deviation from census figures of only 4.8 per cent, 
and thus would appear to be the method of choice. The combined pro- 
ration method estimates had a somewhat higher mean deviation (7.1 
per cent), but were still satisfactory for most purposes. 

The mean “error” found for the preferred method was significantly 
lower than the means found for most of the alternative series. In only 
three instances were the average “errors” not significantly different at 
the 5 per cent level: the combination of estimates based on the censal 
ratio method and weighted by the symptomatic ratio to population 
(C.R. =1.24), the censal ratio method estimates based on registered 
voters (C.R.= 1.58), and the censal ratio method estimates based on 
school enrollment (C.R.=1.86). Critical ratios are given in detail in 
Tables 1 and 3. 


IMPLICATIONS 


It must be remembered that the present study was confined to data 
for the thirty-nine counties of the state of Washington. Because of dif- 
ferent economic conditions, different registration systems, or other in- 
terstate differences, the findings summarized above may have relatively 
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little relevance to population analysis in other areas. Conventional 
techniques such as arithmetic projection or apportionment broke down 
completely in Washington because of wartime growth in the Puget 
Sound, Grand Coulee, and Hanford areas; other states, more stable 
than this one, may present a less extreme situation. 

It should also be remembered that this paper makes no effort to test 
the longer methods of postcensal estimation. Such time-consuming 
techniques as the “migration and natural increase” method recom- 
mended by the U. 8. Bureau of the Census® may well warrant the 
greater amount of calculation they require. Shryock and Lawrence re- 
ported a mean deviation of 8.4 per cent for the preferred form of this 
method when applied to 1943-47 data for 102 counties.’ 

Even so, two definite conclusions emerge from the present study: 

First, it appears that even the simplest use of symptomatic data re- 
sults in more accurate estimates than older short-cut methods make 
possible. Arithmetic projection, for example, produced a mean devia- 
tion of 16.3 per cent for the thirty-nine Washington counties, compared 
to only 7.0 per cent for the censal ratio estimate based on public school 
enrollment. 

Second, it is evident that further improvement can be achieved by 
combining several estimates, properly weighted. The best such series 
tested consisted of an average of censal ratio method estimates weighted 


by the reciprocals of their 1940 coefficients of variation. Average devia- 
tion by this method was only 4.8 per cent. 





* U.S. Bureau of the Cenus, “Suggested Procedures for Estimating the Current Population of 
Counties,” by Hope T. Eldridge, Population—Special Reports, Series P-47, No. 4. 

7 Henry 8. Shryock, Jr., and Norman Lawrence, “The Current Status of State and Local Population 
Estimates in the Census Bureau,” Journal of the American Statistical Association, Vol. 44 (1949), pp. 
157-73. 





SAMPLING SURVEYS IN CENTRAL AFRICA 


J. R. H. SHavut 
Central African Statistical Office 


This paper discusses the importance of the development of 
modern sampling theory in Central Africa. A description of 
the application of sampling methods in Southern Rhodesia 
to the problem of obtaining demographic statistics of the 
African population is given. Special attention is drawn to the 
problems attaching to the application of sampling theory in 
Africa. A table of African vital statistics is given for Southern 
Rhodesia; and the various fields in which sampling methods 
have been tried in Central Africa are described. 


HE revolutionary advances in sampling technique which have taken 

place in the last 25 years have provided statisticians with powerful 
means of measuring many of the most important quantitative charac- 
teristics of primitive peoples in Africa. 

Since the dawn of history the rates of growth of the numbers of 
persons in each primitive race and in extensive regions of Africa, vital 
conditions, area and output of the different crops, extent of malnutri- 
tion, degree of pressure of population on food resources among African 
races have all been imperfectly known [2, 11]. Modern statistical tech- 
nique provides a means whereby all these measurable characteristics 
can be reduced to quantitative terms and will enable one of the most 
important gaps in international sociological knowledge to be bridged. 

The older methods of enumeration by a complete coverage have 
hitherto largely failed to measure the population characteristics of 
African peoples because: 

(a) The African cannot complete Census schedules and successful 
enumeration demands the intensive work of field officers trained 
to secure accurate data from the individual. 

(b) Areas to be covered are of vast extent and the administrative 
machinery is very thinly extended to cover the whole country. 

(c) The resources in finance and personnel are too sicnder to deal 
with the vast areas and the illiterate, sometimes hostile and 
superstitious population. 

Such enumerations as have been taken in the past have in many 
African countries frequently proved to be inaccurate and unreliable for 
practical guidance and in Central Africa at least no successful determi- 
nation of the vital statistics of African races has hitherto been made. 
The pressing local and world demands for African vital and food supply 
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statistics and the impossibility of organizing compulsory registration 
of births and deaths can therefore be met only by sampling. 


PRELIMINARY PHASES AND PLANNING OF THE SOUTHERN RHODESIA 
DEMOGRAPHIC SURVEY 


In Central Africa the sample technique was first employed in South- 
ern Rhodesia by the Central African Statistical Office. 

The type of sample was systematic single stage sampling by Native 
administrative districts of which there are 36 in the country. A system 
of inter-penetrating samples was employed in each Native district and 
this design enabled the work of the field officers to be checked and also 
furnished a sample in those few districts where through unforeseen cir- 
cumstances (illness, etc.) one of the field officers could not operate. 

The Survey sought to determine: 

1. The total indigenous population of each district within a confi- 

dence belt of 10 per cent at the 5 per cent level of significance. 

. The de facto indigenous population of each Native district. 

. The distribution of population by sex and the age groups (a) under 
one year; (b) one year to puberty; (c) over puberty. 

. Total births in the past year. 

. Total deaths in the past year. 

. Deaths of infants below one year of age in the past year. 

. Total number of children born to each woman. 

. Number of children born to each woman and surviving. 

. Total indigenous population of Southern Rhodesia, distinguishing 
de facto population in Native areas, population living out of the 
Native areas and population outside of the country. 

The sampling errors of groups of population bigger than Native dis- 
tricts and of classifications of the population by ages, etc., of Native 
districts and larger groups of these subdivisions were allowed to find 
their own level. 

In organising the sample, the sample unit was the African village. 
This unit was adopted because the most reliable sample frame was the 
list of village headmen kept for the tax registers. The systematic sample 
was obtained by selecting every nth village in the Native district regis- 
ter, the first village on the list being selected at random. In questioning 
adult women present in the village the numbers interviewed exceeded 
.66 and frequently exceeded .90 of the women domiciled in the village. 

The determination of the size of the sample required to yield the 
confidence belt selected for the Native district was obtained from 
Aroian’s formula applied to each district [1, 12]. The standard deviation 
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of the village population in each district had to be estimated and on the 
basis of a pilot Survey in one district a final coefficient of variation of 
roughly .7 was employed. Rough figures of the average size of each 
village were obtained by existing estimates made by the Department of 
Native Affairs. In actual practice the confidence belt was worked on 
the basis of absolute figures rather than a percentage basis, the absolute 
figures being 10 per cent of the existing estimates of the population of 
the district. The total number of villages included in the sample when 
the numbers in each district were added came to 2,561 out of 23,270 
villages on the tax register of the Colony. 


ADMINISTRATION OF SURVEY 


The theoretical basis of the Survey was the formula of Aroian which 
specified the size of sample that is needed to obtain a desired degree of 
precision, and plans based on this formula were submitted to the United 
Nations Sub-Commission on Sampling in 1947 [13, 15]. A pilot survey 
based on these plans was then organised and on the basis of this pilot 
survey plans were laid for a nation-wide sample survey in 1948. 

The work was carried out by 25 trained field officers. All field officers 
had to attend a course of training at Head Office and were taken into 
Native districts in the neighbourhood and given practical field work. 
On the basis of the experience gained from the pilot survey, and the 
training of the field officers and discussions of common problems, a set 
of instructions and forms to be used in the field was drawn up by the 
Central Office and issued to all field officers. The direct cost to the 
Statistical Office totalled £5,600 distributed as follows: 


Pay and allowances to field staff £2 ,400 
Transport charges 2,400 
Camping equipment and medical stores 200 
Analysis clerks, etc. 600 


Total £5,600 


This cost represented £219 per village. 

Messengers were sent in advance to each of the selected villages and 
the headman was notified of the approximate date and time of visit of 
the field officer. The field officers were accompanied by Native messen- 
gers who were aware of the objects of the survey and carefully in- 
structed in the best way of enlisting the co-operation of the headman 
of the village. Since the Native interpreter usually put the questions to 
the inhabitants of the village, the success of the inquiry largely de- 
pended upon his skill and great care was taken in teaching him to win 
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the co-operation of the African and the best way to frame his questions, 
On arrival at the village the field officer interviewed the headman and 
asked him to accompany him through the village. He also ascertained if 
he had been warned and explained to him the nature of the inquiry, 
The huts were counted and pointed out by the headman to ensure that 
none was overlooked. The questioning was done hut by hut and the 
inhabitants of each hut were assembled at the door. Field officers were 
trained to compare the consistency of the answers to questions ob- 
tained from headmen and members of a hut. Normal residents in the 
village and visitors (i.e., persons who had their evening meal the previ- 
ous evening in the hut) were all questioned. Particulars relating to 
absentees were obtained from the head of the family living in the hut. 

The following general procedure was followed in questioning heads 
of huts. 

1. First of all, the head of the hut was obtained and fully questioned. 

2. The same was done for each wife in turn. 

3. When the wife was present, she was asked the questions relating 

to fertility. 

4. The particulars relating to her children were also obtained. 

. If she had any daughters over puberty living with her, particulars 
of their children were obtained. 

. The particulars of other males and females living in the hut were 
recorded. 

. The head of the hut and other members were asked to give par- 
ticulars of deaths in the past year. 

. The people associated with the hut are then counted up and this 
total compared with the number the headman of the village and 
the head of the hut agree as belonging to the hut in order to see if 
any individual is missing. 

The information obtained verbally by the field officer was entered 
into specially designed books in which checks were provided by cross 
totals [4]. Particulars of the attributes of the absentees were obtained 
from the head of the hut but not of the fertility of absent mothers. This 
distinction was made because in the former case estimated totals were 
required, but the fertility questions were required in order to obtain 
ratios such as birth rates, etc. Such ratios would be biased only if there 
was reason to believe that the absentee mothers had a fertility different 
from mothers questioned, whereas failure to record absentee persons 
would necessarily introduce an underestimate of the total African popu- 
lation of the district. 
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In planning the sample nothing was known of the range of size of 
villages and no method of stratification by size of village was possible. 
A rough estimate of the average number of persons per village was ob- 
tained from the estimated population of the district divided by the 
known number of villages. In the absence of stratification systematic 
sampling was practised in order to obtain the advantages of rough geo- 
graphical and tribal stratification in the village lists and also because 
of the tendency for sampling errors in systematic sampling to be lower 
than those of random samples. Within any tribal or geographical strata 
names of villages probably occurred in fortuitous order. 

In every district at least two independent inter-penetrating samples 
were selected. For instance, if it were intended to obtain a total sample 
of 1 in 10 of the villages two independent samples of 1 in 20 covering the 
whole district were obtained. The work of each field officer was so 
organised that it was tested at least three times against entirely differ- 
ent officers working an independent sample in the same area. The test 
applied to the independent results was Fisher’s formula to determine 
whether the results were consistent with those of two independent 
samples from the same universe [10]. A difference of twice the sample 
error between the results was regarded as significant. In such a case it 
was decided to watch the results of both officers in other areas to see 
whether significant differences invariably occurred with either of the 


officers and in this way it proved possible to determine which field 
officer’s results should be disregarded. 


ADMINISTRATIVE PROBLEMS 


The principal difficulties confronting the statistician in Central Africa 
are the vast distances to be covered—usually unmapped—and some- 
times unexplored ; the mobility of the population; the prevailing ignor- 
ance, superstition, and suspicion of the African; and the growing break- 
down of tribal life through contact with the European, resulting in the 
absence of large numbers of Africans from registered villages. Securing 
sufficient motor transport and petrol to maintain 25 field officers for a 
period of two months presented the Central Office with many a head- 
ache. The problems of maintaining contact with the field officers scat- 
tered all over an area of 150,000 square miles, of seeing that petrol was 
at the right place at the right time and providing relief for transport 
break-downs were equally formidable. As regards the mobility of the 
African, field officers would arrive at sites to find the village had been 
transferred elsewhere. 
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The African is not accustomed to using a very large range of numbers 
and the determination of age groups had, therefore, to be made at 
fundamental changes in his life. In practice, the determination of these 
age divisions was not so difficult as was originally anticipated and wag 
facilitated by the preparation of a calendar giving the seasons of the 
year in Native terms. Much time had to be taken by the field officer in 
winning the confidence of the African. In particular, he had to be con- 
vinced there was no element of witchcraft in the background of the 
scheme and that it was not some Government device for exercising 
further unwelcome control of his freedom. A special technique of ap- 
proach was worked out for field officers based on the experience of the 
pilot survey and in consultation with experts in African customs and 
wi+h educated Africans. 

ahe application of sampling in order to obtain ratio estimates of 
African vital statistics is of great importance in Africa, since it provides 
the only promising method of obtaining sound vital statistics of the 
African population over large areas of the continent. It raises methodo- 
logical difficulties which are not sampling, but demographic problems. 
In Central Africa large numbers of male Africans are absent from vil- 
lages working for Europeans. When the village is the sample, unit birth 
and death rates based on the population in the village on a given day 
are very artificial. The only conception of a birth or death rate which is 
of value, is one based on the population normally domiciled in the vil- 
lage (i.e., de jure population). 

The birth rate per 1,000 women over puberty, however, is not an 
artificial ratio. This was obtained by questioning all the women present 
in the village. The birth rate per 1,000 women over puberty was then 
expressed as a birth rate per 1,000 persons normally domiciled in the 
village by correcting for the ratio of adult women to the total popula- 
tion. Since a sample inquiry obtains the number of births and deaths 
in the 12 months preceding the date of inquiry, it is necessary to correct 
the estimate of the population obteined at the sample date to the mid- 
year population (i.e., to subtract nalf of the births during the past 6 
months and add half of the deaths). These operations involve correcting 
the customary formulae of variance of a ratio. 

In the case of vital statistics the number of districts which had to be 
abandoned was larger than in the other estimates because a larger 
number of field officers were disqualified by the inter-penetrating sam- 
ple test than in the other population attributes and consequently rates 
for provinces sometimes do not cover the whole population. For South- 
ern Rhodesia as a whole the birth and infant mortality rates embrace 
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93 per cent and the death rates 83 per cent of the total African popula- 
tion. In view of the importance of the subject of African Vital Statistics, 
considerable space is directed to the methods employed in the next 
section. 

The system of inter-penetrating samples was valuable in eliminating 
field officers’ errors. It also provided results in cases where for unfore- 
seen reasons (sickness, etc.) a field officer was unable to do his work. 
In general it showed that the work of field officers was good, the largest 
proportion of failures being found in the questions relating to births 
and deaths. The following table shows the number of samples finally 
accepted in each district. 








No. of Inter- No. of Inter- 
penetrating penetrating 
Samples Used Samples Used 
District by District by 
Province Birth and Province Birth and 

infant | Death infant | Death 
mortality | rates mortality | rates 
rates rates 
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13. Chilimanzi 30. Shangani 
14. Gwelo 31. Wankie 
15. Sebungwe Umtali Province: 
16. Selukwe 32. Chipinga 
Victoria Province: 33. Inyanga 
17. Bikita 34. Makoni 
18. Chibi 35. Melsetter 
19. Victoria 36. Umtali 
20. Gutu 
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ESTIMATING PROCEDURE 


Theoretically a birth rate can be obtained by taking a sample of 
villages, finding the number of children born in the last year and the 
total population and working out the birth rate. In actual practice, on 
arriving at the selected village a number of adult women will be in the 
town or visiting elsewhere and cannot be questioned. In the estimated 
birth rates it was assumed that as these absentees were equally likely 
to be present at the time of the visit there was no reason to assume their 
experience would differ significantly from that of the women ques- 
tioned. 

The estimation formula employed was 

Xb, uF. 
1 1 


R= 1,000 
x P—ih x 
LW. 


1 





where 6,=the births to women questioned in villages, W, the adult 
women questioned, F, the total number of adult women domiciled in 
the village, P the total de jure sample population, n the number of 
villages in the sample, and h the natural increase of the district in 12 
months. A more usual conception of a birth rate is provided by the 
formula 


F, 
W 


Db. 
1 s 

R = ——— X 1,000. 
P — th ” 


A test of the difference in the results given by the two formulae was 
made with 7 districts selected at random from the 36 districts. Five 
out of the seven districts showed trifling differences in excess and defect 
of the latter formula and the other two showed errors of 4 per cent in 
excess and defect. The provincial and territorial estimates of the birth 
rates are therefore relatively free from any bias error from the estima- 
tion formula. 
The formula used for the variance of the death rate was 


V(*) = : — nS 





SAMPLING SURVEYS IN CENTRAL AFRICA 


where 
S? = i (y. — 72,.)? +n —1, 
: ; 


y=number of deaths per village, x=total population per village, 
7=)1Ye/D 10s or the death rate in terms of unity, f the sampling frac- 
tion, and N the number of villages in the sample [8,9]. Some adjustment 
in > 1ye/)_12. had to be made to correct the death rate tothe estimated 
mid-year population. This adjustment was made on the strength of the 
sample results, but the correction which should have been made in the 
formula for the variance of this adjustment factor was ignored. It is 
not considered that it would greatly affect the variances which have 
been published. 

For all estimates grouping several districts the sample variances of 
the districts were multiplied by the square of the weights used to com- 
bine them and then added together. These weights were determined 
from the sample. The variance of all rates expressed per 1,000 are 
multiplied by 1,000?. 

Since the women present in a village and questioned at the time of 
the field officer’s visit were equally likely to have been absent, the 
variance formula employed for the birth rate was regarded as equiva- 
lent to that of a two stage sample. The following formula was employed 


mr \ 
Var (R) = - var (r) 


P- 4 


where n was the number of villages sampled, F, the total number of 
women in village S, P the total population in the district, h the net 
increase in the district during the 12 months, and r the estimated ratio 
of births per adult woman questioned (i.c., )-;b./>_;W.). The variance 
of r was the two term formula 
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where in addition to the preceding symbols, 7 was the total number of 
villages in the district W, the number of women questioned in Village 
S, Q:=>-3(b.—rW,)? b, the number of births in village S, r, the birth 
ratio per adult woman in village S, and g,=(1—r,). 

The ratio used to estimate infant mortality was r.=)>_1d./)_10. where 
d, is the deaths and 8, the births in the village S. Since the sample se- 
lection was systematic, this rate should be automatically weighted by 
relative sizes of village in the district but the estimate contains a bias 
error in so far as the ratio of the women questioned to the total women 
in the village was not constant from village to village. Inspection of the 
returns did not indicate correlation (positive or negative) between the 
numbers questioned per village and the size of infant mortality per 
village. The variance formula employed was 


n 
1 —-—nQ, 





Var (r.) = 


m9 ( 2.) 


where, in addition to the former symbols, r, was the infant mortality 
rate and Q.=)>_3(d.—reb.) 22 


SAMPLE VARIABILITY 


The extent to which a sample error of twice the standard deviation 
(representing 10 per cent of the district population) was attained is 
shown in the following table. In some cases, failure to attain the figure 
was due to the falling out of a field officer or disqualification when his 
work was tested by the inter-penetrating samples. 

For each district the survey was planned to yield an absolute error 
equal to 10 per cent of the existing estimates of the district African 
population and which would be equal to twice the sample error of the 
survey. It followed that when the sample figures were determined, the 
actual limits frequently differed from 10 per cent of the sample esti- 
mates, such differences depending on the accuracy of the pre-existing 
population estimates. 

The large sample error for Umtali was caused by an accidental omis- 
sion of 81 villages from the frame of 661 villages and as a result a really 
satisfactory limit could not be determined. In general, the variability 
of the villages was underestimated, but it has since been calculated 
that a similar size sample using a ratio with supplementary information 
for estimation or a simple division of the villages into 3 strata by size 
of village would considerably reduce the sample errors. 
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SAMPLE ERRORS OF POPULATION ESTIMATES (MULTIPLIED BY 
TWO) BY PROVINCES AND NATIVE DISTRICTS, 1948 





—_— 





Sample Error *2 Sample Error X2 








Administration Per cent Administration Per cent 
Area Ab- of Area Ab- of 


solute solute 


estimate estimate 





Southern Rhodesia | 44,800 2.8 
Districts: 
Provinces: Hartley 
Salisbury 18,800 ‘ Insiza 
ywelo 17,200 , Inyanga 
Victoria 15,600 j Lomagundi 
Bulawayo 21,500 . Makoni 
Umtali 25 ,600 ‘ Marandellas 
Districts: Matobo 
Belingwe — Mazoe 
Bikita 4,600 Melsetter 
Bubi 3,400 l Mrewa 
Buhera 6,700 Mtobo 
Bulalima- Ndanga 
Mangwe 8 ,600 Nuanetsi 
Bulawayo 6,500 Nyamandhlovu 
Charter 8 ,000 Salisbury 
Chibi 5,200 Sebungwe 
Chilimanzi 4,200 Selukwe 
Chipinga 8,400 Shangani 
Darwin 5,100 Umtali 
Gutu 11,700 Urungwe 
Gwanda 11,400 Victoria 
Gwelo 7,000 Wankie 


























NON-SAMPLING ERRORS 


There is no known reason why the estimates of death rates should 
contain biassed errors. The type of non-sampling error which would 
escape detection by inter-penetrating samples would arise from some 
common failure among the Africans to give field officers in the same 
district correct information. This can be detected by tests of consistency 
and by external knowledge. The natural rate of increase determined 
by the sample is consistent with the known rate of increase of numbers 
in the tax register. In some individual districts deaths appear under- 
stated. Among these are Lomagundi, Selukwe, Nuanetsi, Bubi; com- 
parison of these with infant mortality rates suggests that in the case of 
each of these districts, other than Lomagundi, infant mortality is 
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1948 SAMPLE CENSUS: BIRTH, DEATH, AND INFANT MORTALITY RATES OF 
THE AFRICAN POPULATION OF SOUTHERN RHODESIA* 








Infant Mor- 


Birth Rate Death Rate tality Rate 


Natural 


Increase 
Province and ‘ Deaths 
District Births | sample | Deaths | sample nod per | Sample 
per per 1,000 
error 000 error Seemann 1,000 error 
1,000 x2 1, x2 Live x2 
persons Persons Births 








Saispury PROVINCE: 

District: 
Darwin 
Hartley 
Lomagundi 
Marandellas 
Mazoe 
Mrewa 
Mtoko 
Salisbury 
Urunge 
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GwELo PROVINCE: 

District: 
Belingwe 
Buhera 
Charter 
Chilimanzi 
Gwelo 
Sebungwe 
Selukwe 





GwELo PRovINCE 





Victoria PROVINCE: 
District: 
Bikita 
Chibi 
Victoria 
Gutu 
Ndanga 
Nuanetsi 





Victoria PRovINcE 





BuLAWAYO PROVINCE: 
District: 

Bulawayo 
Bubi 
Bulalima-Mangwe 
Gwanda 
Insiza 
Matobo 
Nyamandhlovu 
Shangani 
Wankie 
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Infant Mor- 


Birth Rate Death Rate tality Rate 








Province and , Deaths 
District Births Sample Deaths Sample per Sample 
par error pet error error 
=| | . 
TSO 





BuLAWAYO PROVINCE ° . . id J 49 





UmTaL! PROVINCE 
District: 
Chipinga . . . J i 26 
Inyanga ° ° ° m ‘ 41 
Makoni ‘ i ~ Z . 98 
Melsetter ® J J + 22 
Unmtali . , J 6 64 





UmTaLi PRovINCE . . . , s 33 























SovuTHERN RHODESIA . 1.5 18.1 J " 131 15 








* The rates and errors for Provinces and the Colony relate to the total population of the area 
sampled, and confirmed by the inter-penetrating samples. 

® Samples abandoned for death rates because the field officers concerned did not prove satis- 
factory in this section of the inquiry. 

> No sample taken. 


understated. In Lomagundi, the low death rate if not accepted, must 
be attributed to an understatement of non-infant deaths. The excep- 
tionally high death rates in some areas is due to epidemics. In Wankie 
and Shangani there was a serious epidemic of smallpox. The explana- 
tion of variations in death rates from district to district await more in- 
tensive study. The true test of the non-sampling errors will come when 
the survey has been repeated and for this reason the district samples 
will be increased in the next survey. In the meantime, no reasons have 
been advanced either by medical or administrative officials for believing 
that Africans in any given district would consistently give wrong in- 
formation to the field officers. 

In the case of births it may be argued that women present and ques- 
tioned are more likely to be mothers than absentees, and that conse- 
quently birth rates are overstated. Against this is the fact that African 
women work within a very short time of delivery. It is, however, 
possible that the birth rates have an upward bias. This possibility can 
be finally checked only after a repetition of the survey based on larger 
district samples and a sample survey of African women in urban areas. 
Comparison with similar surveys in Northern Rhodesia and East 
Africa do not suggest any serious non-sampling errors in the results [6]. 

Examination of the infant mortality rates indicates that in certain 
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districts infant mortality was understated. Of these the most impor- 
tant are in Gwelo Province, Nuanetsi, Bulawayo, Bubi, Gwanda, 
Matobo. No satisfactory reason why the African women should have 
failed to give the correct information in these districts has been dis- 
covered. 


RESULTS OF SURVEY 


In view of the importance of this subject not only to Southern 
Rhodesia, but to Africa and in all international studies, the table of 
vital statistics provided by the census and twice the sample errors are 
shown in the table on the next page. 


PUBLICATION OF RESULTS AND FUTURE ACTIVITIES 


Time is of vital importance in demographic surveys because the pop- 
ulation characteristics change from day to day and in the absence of a 
Census on a given night it is necessary to approach this ideal as closely 
as possible by reducing the field work to a minimum period. The field 
work started 1st August and in some cases extended into October, but 
the bulk of the work was completed in August, and it seems reasonable 
to regard the Census as giving the situation at 3lst August, 1948. No 
Census of Population could have been done in so short a time, orso 
cheaply. Furthermore, the intensive work which field workers were 
enabled to do through the time saved by the sample method placed the 
quality of the results well above that to be expected from a complete 
coverage and enabled subjects to be attempted which otherwise would 
not have been contemplated. 

& The preliminary results of the survey were published on 7th De- 
cember, 1948 in the Economic and Statistical Bulletin of Southern 
Rhodesia, roughly four months after the date of the Census [3, 14]. Re- 
vised estimates of the African Population by Native District, Sex, and 
Age Groups were published in the Bulletin of 21st February, 1949 [6] 
and the vital statistics were published in July 1949 [7, 16]. Full details 
of the survey including instructions to field officers, forms, etc., used are 
contained in the final report published in July, 1951 [4]. To state that 
had a full Census been successfully taken the results would still be 
coming out piece-meal sufficiently indicates the value of the method. 

Besides the demographic field, sample inquiries into African agricul- 
ture are being organized in Central Africa. In 1949, a Sample Census of 
African Agriculture was organized in Southern Rhodesia and the work 
benefited greatly from the experience gained from the demographic 








E 1959 


1por- 
nda, 
have 

dis- 


hern 
e of 
are 


Is 
re 
at 
De 


|. 
of 








SAMPLING SURVEYS IN CENTRAL AFRICA 253 


Census. A full account of the survey is contained in the report issued in 
August, 1951 [5]. In 1950 a demographic Sample Census was taken in 
Northern Rhodesia and preliminary results have been published [8, 9]. 
These tend to confirm the results of the Southern Rhodesia demo- 
graphic survey since the relationship between the figures is of the order 
of magnitude which might be expected in view of the differences in the 
natural conditions in the two countries. A pilot Sample Survey of 
African Agriculture was also taken in Northern Rhodesia. Full value 
cannot, of course, be derived until these surveys are repeated and a time 
series built up. Proposals have been submitted for holding an annual 
demographic survey in Southern Rhodesia which will deal with one 
third of the country with the object of covering the whole of Southern 
Rhodesia every three years. Again sampling methods are being em- 
ployed to obtain advance estimates of the characteristics of the non- 
African population from the 1951 Census of Population. 
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FACTORS { THE ACCUMULATION OF 
SOCIAL STATISTICS 


SoLoMON FABRICANT 
New York University and National Bureau of Economic Research 


NALYSIS of the factors determining the stock of social statistics may 
be facilitated if we first formulate some general statements con- 
cerning changes in the stock itself. The appropriate viewpoint is, of 
course, that of the student of society. Let us, therefore, refer here to 
publicly available summary observations. Data reposing in private files 
or published only in the form of endless detail on individual economic 
transactions or units, like individual wholesale prices or lists of em- 
ployees of state and local governments, may be processed into addi- 
tions to the stock, but until this is done do not form part of it. 

What has happened to the volume of social statistics in the United 
States during the past 160 years? The following conclusions will all 
seem pretty obvious to statisticians, though they are not less significant 
on that account. 

(1) There has been a vast and fairly continuous expansion in the 
stock of social statistics. There are no simple measures of this stock, 
nor, indeed, is the way to construct them obvious. Yet even a quick 
thumbing through of the Bureau of the Census’ Historical Statistics of 
the United States, 1789-1945 (1949), should be sufficient to support our 
conclusion. Or one may compare the collection of time series in Fluctua- 
tions in American Business, 1790-1860 by W. B. Smith and A. H. Cole 
(1935) with such a collection on the recent period as the appendix to 
A. F. Burns’ essay on “Economic Research and the Keynesian Think- 
ing of Our Times” (26th Annual Report of the National Bureau of 
Economic Research, 1946). The former shows what we have on that 
period better than do contemporary compendiums like Adam Seybert’s 
Statistical Annals (1818), since it includes many original statistics as- 
sembled by the authors; yet it seems skimpy compared with Burns’ 
rich and easy sampler. And the wealth of “survey” data today stands 
in sharp contrast to the few samples of earlier decades. 

(2) Growth has occurred also in the annual accretion to the stock. 
More and more time series have been established to report contempo- 
rary events, and more and more historical series have been constructed 
from the fragmentary data of the past. The “time-period” index to 
Historical Statistics shows, approximately, the combined effect of both 
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developments on the number of series available for various decades.) 
Family and other surveys have become periodic rather than sporadic, 

(3) The composition of the stock of statistics has been improved with 
time. Surveys have come to be made of facets of social life known but 
not yet measured, and of facets newly developed and discovered. Some 
facets formerly mapped only in the large have been plotted in some 
detail. Thus, the bulk of the statistics relating to the opening decades 
of the 19th century are on population, foreign trade and shipping, 
wholesale prices, and the finances of the federal government. Con- 
temporary information covers number and other characteristics of 
business establishments, as well as people ; the volume of domestic trade 
and production, as well as foreign trade; farm prices, retail prices, and 
wage rates, as well as wholesale prices; and the finances of families, 
business enterprises and state and local governments, as well as federal 
finances. The earlier data on population tell little more than number, 
age, sex, and color. Today there is extensive information on birth, 
death, marriage, and divorce rates, family characteristics, occupation, 
education, and so on. And today’s data are cross-classified to a greater 
extent than in the past. The growing number of census inquiries on 
each subject, summarized in C. D. Wright and W. C. Hunt’s History 
and Growth of the United States Census (1900), shows how this major 
source of information has contributed to the increase in detail. 

(4) Observations are available more frequently. The trend in the 
relative number of monthly and quarterly series has been upward. For 
evidence of this it is sufficient to refer to the collection of time series at 
the National Bureau of Economic Research, described briefly in A. F. 
Burns and W. C. Mitchell’s Measuring Business Cycles (1946). And 
income and expenditure surveys, rare in the past, have become com- 
monplace. 

(5) Observations are available more promptly. Statistics are col- 
lected, tabulated and distributed more rapidly than they used to be. 
There is no readily available evidence on this point except for the 
census reports. We can be pretty sure, however, that the current lags 
between event and publication, some of which are given in the report 
on The Statistical Agencies of the Federal Government by F. C. Mills and 
C. D. Long (1949), are shorter than in the past. 

(6) In other respects, too, the quality of our statistics has improved. 
Whatever doubts there may be about the accuracy of our current sta- 





1 The showing is only approximate (and biased upward) partly because the number of terminated 
series is not given, partly because the collection covers the earlier decades rather less adequately than the 
later decades, as the compilers point out. 
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tistics, there can be no doubt that they are on the whole more accurate 
and more representative of the activities they purport to cover than 
were statistics in the past. W. 8S. Holt’s useful report on The Bureau of 
the Census, Its History, Activities and Organization (1929) provides sum- 
mary evidence of improvement in the quality of one very important 
body of statistics. 

Now, what factors underlie the increase in the part of the nation’s 
capital stock with which we are concerned here? Why have these sta- 
tistics been collected and published in ever-increasing quantity and 
ever-improving quality? Because federal, state and local governments 
constitute by far the chief source of social statistics, our question may 
be put primarily with reference to the factors accounting for growth in 
government’s statistical output.’ 

One important reason why our statistics have grown is expansion in 
the nonstatistical functions of government. The sectors of social life 
over which government exercises its various powers have multiplied, 
and more aspects of the life in each sector have come under the sur- 
veillance of government. Increase in the scope of governmental activity 
is a phenomenon observed by all of us. Public health, factory and food 
inspection, charities, unemployment compensation, central banking, 
regulation of security issues, assistance to farmers and home owners, 
roads, and comprehensive river-valley development—these remind us 
of forms of governmental activity that have come on the scene or been 
greatly expanded in recent decades. The trend is one of long standing, 
however, as is evidenced by the formulation before the close of the 
nineteenth century of Adolph Wagner’s “law of increasing State activi- 
ties.” The list of government activities has lengthened to the point 
where its contents, and changes in them, make substantial publica- 
tions.’ In each of its activities the government keeps records and 
gathers special information for use in current operations, planning, and 
budget preparation and justification. When released (though why they 
are released is a further question), these by-products pass into the 
stock of publicly available statistics. 

The nonstatistical activities of government constitute also an im- 
portant demand factor. The growing needs of administrative and regu- 
latory agencies for information in current operations and planning led 





2 Of the 1,070 tables in the 1948 Statistical Abstract only 70 are in whole or even in part the product 
of private agencies. 

* See, for example, the “Inventory of Governmental Activities in the United States” by C. H. 
Chatters and M. L. Hoover (1947); the several reports on the growth of city, county, and state functions, 
of which L. D. Upson’s on Detroit (“The Growth of a City Government,” 1942) is a prototype; and the 
useful set of “service monographs of the U. 8. government” published by the Brookings Institution, 
one of which has already been cited. 
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to the expansion of statistics collected by “public-purpose collection 
agencies” like the Bureau of the Census and the Bureau of Labor Sta- 
tistics. A fair amount of the statistics they gather is in response to 
demands on the part of other government agencies. For example, the 
basic constitutional provision underlying the work of the Bureau of the 
Census was intended to aid in administering another provision of the 
Constitution, that controlling representation, rather than to provide 
statistics. Further, it is in response to the demands of still other ad- 
ministrative agencies that some of the by-product statistics of adminis- 
trative agencies are collected, even in the first instance, or if already 
collected are tabulated and released. 

As governmental activity increased, not only absolutely, but also in 
relation to total activity, new sources of revenue were developed. These 
in turn became important sources of by-product statistics. Statistics of 
Income and the sales tax data are examples. Informational needs of the 
revenue agencies also helped to raise the demand for the general sta- 
tistics collected by the public-purpose collection agencies, and thus con- 
tributed to the expansion of the activities of the latter. Similarly, the 
revenue agencies have collected statistics to meet the needs of other 
government agencies. 

In this way, then, the volume of statistics rose with the growth of 
government activity. This growth led also to better balance in the 
composition of our statistics. The latter improvement came, however, 
because specific needs in the widening activities of government were 
met rather than because the imbalance was felt and the general needs 
for information about dark areas satisfied. 

A special word must be said about the influence of World War I on 
government statistics. Even before that war the trend was toward in- 
crease in the central government’s role in the regulation and adminis- 
tration of economic life. Exercise of that role was bound to point up the 
need for statistics “not only as a record of what had happened,” as 
Professor Mitchell told the American Statistical Association right after 
the war, “but also as a vital factor in planning what should be done.” 
The war threw the country into a situation demanding immediate and 
extensive central planning and administration, and thus precipitated 
the need for more and better statistics. It is not an accident, therefore, 

that so many of our time series, especially monthly series, begin with 
that period. 

The statistical services of government meet not only the demand of 
government itself but also demand on the part of the public. It is this 


4 Quarterly Publications of the American Statistical A iation, March, 1919; reprinted in The 
Backward Art of Spending Money and Other Essays (1937). 
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demand that requires publication of statistics, for governmental needs 
can be, and often are, met within the confines of government without 
publication. What is in back of the public’s demand? 

Fundamental, I think, is Western civilization’s tendency to become 
more and more “peculiarly matter-of-fact,” as Thorstein Veblen put it.5 
Our attitudes and values have been shaped by the overwhelming suc- 
cesses of modern science and technology. We are convinced, in ever- 
increasing measure, that growth and diffusion of scientific—that is, 
matter-of-fact—knowledge is “indefeasibly right and good.” As a result, 
we have come to value the methods of science, of which statistics is 
now one. We see worth in their application to all situations, including 
those in which they have not yet borne fruit. 

Increasing public interest in and demand for social statistics rests, 
then, on the basic premise that the problems of society, as well as of 
natural science and technology, can be solved by the increase and dif- 
fusion of this especially matter-of-fact type of matter-of-fact knowl- 
edge. The whole world now seems to hold that statistics can be useful 
in understanding, assessing, and controlling the operations of society. 

More peculiar to this country is a strong interest in ourselves, stimu- 
lated by our extraordinary growth. Growth in the population, trade, 
industry, and cities of the United States has always been so rapid as to 
command popular attention and admiration. It seems right and natural 
that every census should reveal expansion. Small increases surprise and 
appear prima facie questionable; the Bureau of the Census has fre- 
quently had difficulties with city fathers disappointed with the slow 
growth of their broods. 

A society in which science and technology are transforming its in- 
dustries and institutions as well as stimulating its growth, is one in 
which social problems crop up. These raise specific demands for in- 
formation. The statistical work of the National Monetary Commission 
is one example of response to these needs. Once statistical explorations 
are made, periodic collection frequently follows: consider the history 
of the “Monthly Report on Labor Force.” The current volume of time 
series and survey data being published has in this way been expanded 
and its composition modified. 

Social changes that create problems also often lead to extensions of 
governmental activity to deal with them. From this, as already noted, 
many by-product statistics emerge, of which the great collection of 
Interstate Commerce Commission data is a shining example. 





5 “The Place of Science in Modern Civilization,” The American Journal of Sociology, March, 1906; 
reprinted in The Place of Science in Modern Civilization and Other Essays (1919). 
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Concomitant change occurs also in the nature of business, thereby 
stimulating a demand for statistics on the part of business enterprises, 
Expansion of long-term investments of a type that (unlike land) is 
recognized as subject to obsolescence, has resulted in attempts to widen 
horizons. These create needs for various types of statistics not felt 
necessary in earlier days. Related is growing business demand for sta- 
tistics on the presumption that more and better information can help 
prevent or ameliorate difficulties arising from short-term economic 
change. The great expansion of statistics by the Department of Com- 
merce after World War I was in part founded on this presumption. The 
rise of advertising and national marketing is another cause of increas- 
ing business demand. And surely the expansion of trade-unionism has 
stimulated a demand for statistics by businessmen, as well as by the 
unions and government agencies concerned with industrial relations. 

Development of economic and statistical science also has accom- 
panied—been forced by—social change. These branches of learning 
have had a share in expanding and improving governmental statistics, 
as well as in swelling the demand. From its very birth the American 
Statistical Association has had a significant role in improving and en- 
larging our stock of statistics. The contribution of its members to small 
sample theory and practice is an example of a factor on the supply side. 

Enlarged scientific experience has resulted in displacing the vague 
feeling that factual knowledge is good by a sharper understanding of 
how and in what way statistics can help. And as in other scientific 
work, an important consequence of statistical research and analysis has 
been the discovery of other, as yet uncollected, statistics that are neces- 
sary and desirable. Index numbers, national income accounts, input- 
output tables, and other ways of summarizing and presenting data, 
first developed and put to use by private investigators, have been taken 
over and utilized by the federal government and have thus expanded 
the scope of its statistical output. Realization of this fruitful inter- 
change has provided another reason for publication of what might 
otherwise remain intramural governmental statistics: that is the op- 
portunity afforded for review, assessment, and criticism, as well as use, 
by private agencies and scholars. 

Physical science and technology appear on the supply as well as the 
demand side of our account. Their growth has helped to cut the costs of 
collecting, processing, and publishing statistics and to speed up col- 
lection and transmission. Tabulating equipment, mechanical computers, 
and the telephone are essential statistical tools today, without which 
the rapid presentation of adequate statistics would not be possible. 
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The spread of the pecuniary standard and improvements in records, 
to which government tax and regulatory requirements have contrib- 
uted, has made it possible to obtain better, more uniform, and more 
detailed statistics from families and farmers as well as businessmen. 
Together with increased size of business enterprise and better instru- 
ments of communication this development has made it possible also to 
secure data more speedily. Appreciation of the value of statistics on the 
part of businessmen has also helped to lessen resistance to government 
questionnaires and provide the voluntary cooperation so essential to 
good quality. 

These various factors have also swelled the flow from nongovernmen- 
tal sources of statistics on “aggregates” as distinct from reports on indi- 
vidual economic units. As mentioned earlier, the bulk of our social sta- 
tistics originate in governmental sources, one reason being that the 
marginal social value of statistics exceeds their marginal private value: 
statistics are like lighthouses and street-lamps. Another reason is the 
ensurance of secrecy. But nongovernmental sources like trade associa- 
tions, central markets, and research institutions should not be ignored. 
Their summary statistics contribute to the current flow directly as well 
as through governmentally assembled larger aggregates. Nongovern- 
mental statistics have increased in volume partly because their sources 
have grown in number, and partly because reports from them have 
expanded in scope. Obvious examples are provided by the trade associ- 
ation statistics on production used by the Federal Reserve Board; the 
statistics issued by the New York Stock Exchange; and the series on 
department stores currently compiled at Harvard University. 

It appears, then, that the statistical stock of the country increased 
because of such factors as the widening and deepening of government 
activities, advances in technology, the appearance of new industries 
and new problems, and the growth of trade associations and similar 
organizations. No assessment has been attempted of shifts in the role 
of the several factors in the development of statistics, nor has it usually 
been possible to attach even a rough average weight to each. Our analy- 
sis is not much more concrete than those found in earlier discussions of 
this subject. One reason for this is that the various factors are not in 
fact independent of one another. Are they not all, indeed, characteristic 
features of the economic development of countries in which income 
rises, the main contours of industrial structure are altered in the direc- 
tion expressed by the term “industrialization,” people move to the cities, 
and so on? If we were to summarize the various factors we have con- 
sidered in a few words, it might be in the proposition that the growth 
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of our statistics is itself an aspect of our economic and social develop- 
ment. It has surely been greatly influenced by that development. 

But we should not minimize the influence of other factors that have 
affected our statistical wealth directly, and perhaps also indirectly 
through their influence on the role of government and other factors in 
our development. Thus, one of the great compromises of the Constitu- 
tional Convention, that seats in the House be apportioned by popula- 
tion and therefore that a decennial Census be taken, must have had a 
special influence on the development of statistics in this country, and 
must help explain our high level relative to other countries. The Consti- 
tutional requirement itself is minimal, necessitating only a bare count 
of-each of two groups of the population. But from the beginning it has 
been a vehicle on which more and more has been loaded. Even the first 
Census asked more than the minimum, and when the second was being 
planned, there were already requests for additional information—a de- 
mand that has grown ever since. 

If we are right in our conclusion that the upward trend in the stock 
of statistics is largely a concomitant of economic and social growth, it 
follows that upward trends have characterized the statistics of all coun- 
tries participating in that development. Also, if we were to take a view 
of the world today we would find a significant degree of correlation 
between volume of statistics and degree of development, whether meas- 
ured by income per capita, importance of non-farming industry, or 
otherwise. These inferences would be confirmed by even a quick glance 
at Colin Clark’s Conditions of Economic Progress and the statistical 
publications of the United Nations. 

One implication of this for social science is that plenty of work needs 
to be done to extract and organize information from the meagre records 
of unindustrialized countries, past or present. For a necessary condition 
of improving our understanding of economic development is to know 
more about the countries and periods for which adequate social statis- 
tics are now lacking. 

A final word on the question of economic development, to which we 
have been led. Growth of our statistics has been greatly influenced by 
economic development. In turn, however, statistics have helped to mold 
that development. For the presence of expanded statistical information 
affects economic behavior. It makes possible governmental activity 
(some would say, interference) of a kind not otherwise possible. And it 
influences the behavior of businessmen, who have always been sensitive 
to information. Like other elements in the nation’s capital stock, sta- 
tistics are cause as well as consequence of economic change. Statistics 
play their part in the cumulative process of economic development. 





COMPARISON OF SELECTED MEASURES OF ABILITY 
OF COMMUNITIES TO BEAR TAX BURDENS 


Lorin A. THOMPSON 
University of Virginia 


OcAL governments have been searching for additional revenues to 
L cover increasing cost of services provided. Local taxes have been 
raised by increasing tax rates on property. These increases, however, 
have not always been closely related to the increases in income and 
market value of the property against which the taxes have been as- 
sessed. It is evident, therefore, that in measuring the tax burden of 
local governments, the taxes collected need to be related to a base 
which is more realistic than assessed values. Federal tax burdens for 
sometime have been related to the national income. It therefore seems 
appropriate to apply the same logic to the measurement of tax burdens 
in local areas.! 

The correlation of estimated income in Virginia cities and counties 
in time is indicated by the following coefficients of correlation for cer- 
tain pairs of years: 


1930 vs. 1940 r = 0.93 
1930 vs. 1947 r = 0.86 
1940 vs. 1947 r = 0.89 


In contrast, the correlations between true tax rates? for Virginia cities 
and counties for selected years are: 


1930 vs. 1947 r = 0.49 
1936 vs. 1947 r = 0.63 
1939 vs. 1947 r = 0.68 
1942 vs. 1947 r = 0.69 
1944 vs. 1947 r = 0.84 





1 The estimates of income for local areas used here were taken from studies by members of the 
University of Virginia Bureau of Population and Economic Research: Lancaster, J. L., “Per Capita 
Income of Virginie Counties,” University of Virginia News Letter, Vol. 22, No. 1, October 1945; Gilliam, 
Sara, K., Distribution of Income in Virginia, 1947, Bureau of Population and Economic Research, 
University of Virginia, Charlottesville, Virginia, 1950. 

2 “True tax rate” as used here is the average ratio of assessed value of real estate to actual sales 
price during a celendar year. The validity of this measure in any year is influenced by the extent to 
which the properties sold constitute an appropriate sample of all properties assessed. 
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The correlations of per capita income are significantly higher than the 
correlations of true tax rates for different pairs of years. 

In Table 1 several selected factors suggested as measures of economic 
ability to bear tax burdens are compared with per capita income and 
one another for the year 1947. 


TABLE 1 


CORRELATIONS AMONG SEVEN VARIABLES RELATED TO THE 
TAX LOAD OF VIRGINIA COUNTIES AND CITIES, 1947 








2 3 4 5 6 j 
Per Per Per Per Per Per 
Variable Capita Capita Capita Capita Capita Capita 
True Assessed Local State Federal Total 
Wealth Wealth Tax Tax Tax Tax 





Per siete Income 0.61 0.57 0.75 0.79 0.82 0.85 

Per Capita Sine Wealth 0.81 0.63 0.68 0.63 0.68 
Per oui Assessed 

Wealth 0.69 0.70 0.60 0.69 

Per Capita Local Tax 0.74 : 0.79 

Per wade State Tax . 0.94 

Per estes Sateen Tax 0.96 





The income estimates were prepared by the Bureau of Population and Economic Research. Per 
capita true wealth figures were prepared by the Virginia Department of Taxation and were based on 
ratios between assessed and market values of such classes of wealth as real estate, tangible personal 
property, and the like. Per capita assessed wealth was obtained by dividing total local assessment by the 
estimated population. The per capita local, state, federal, and total taxes were obtained by dividing the 
amounts of such taxes as could be identified and correctly allocated by the population of the locality. 
The per capita taxes do not represent all taxes paid since the method of allocating all taxes—federal, 
state, and local—is exceedingly complicated. The per capita taxes used are those taken from official 
records which are readily identified. 


The difference between the correlation of per capita true wealth and 
per capita income and that of per capita assessed wealth and income is 
not statistically significant. There is little difference, however, between 
true and assessed wealth as a measure or clue to economic ability. Even 
the correlation between per capita true wealth and per capita assessed 
wealth permits considerable error in predicting true wealth from as- 
sessed wealth. 
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The lower correlation between per capita local tax payments and per 
capita income, as compared with the correlation between state and fed- 
eral taxes and income, may be due to the fact that local taxes are in- 
fluenced more extensively by assessed wealth than are either the state 
or federal taxes. The correlations of per capita true wealth and per 
capita assessed wealth with local, state, and federal taxes are lower 
than corresponding correlations of taxes with per capita income. This 
suggests that despite the fact that appraisal or assessment of wealth is 
the basis of local tax liability, taxes tend to be adjusted to income re- 
gardless of how they are assessed. But use of wealth as a measure of tax 
liability is at best a crude and rough measure of ability and for this 
reason tends to place an uneven burden on different groups of tax- 
payers. 

The relationships between per capita local, state, and federal taxes 
and income and wealth measures become a little clearer if the analysis 
is pursued by computing partial correlations. First, let us compute the 
second order coefficients of correlation between per capita local, state, 
federal, and total taxes versus true wealth and assessed wealth, holding 
income constant in each instance. The results are as follows, with 
variables identified by the numbers in Table 1: 


Ta. = 0.33 Tua = 0.48 
T5.1 = 0.41 735.1 = 0.50 
Tei = 0.29 Te.1 = 0.28 
Tez. = 0.39 T3711 = 0.47 
Next true wealth and assessed wealth respectively are held constant 


and the r values computed between the per capita local, state, federal, 
and total tax rates and per capita income: 


Tus = 0.59 ru.s = 0.60 
T.2 = 0.65 15.3 = 0.67 
rie2 = 0.71 ri6.3 = 0.73 
Ti72 = 0.73 Ti7.3 = 0.77 


These correlations tend to suggest that local, state, and federal taxes 
are actually more closely related to income than to true or assessed 
wealth. There is a slightly higher correlation between the per capita 
tax rates and in¢ome when assessed wealth is constant than when true 
wealth is held constant. This difference, however, is quite small. 
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An examination of per capita income, per capita true wealth, and per 
capita assessed wealth as measures of economic ability among com- 
munities shows that income measures as developed by the Bureau of 
Population and Economic Research have a higher internal consistency 
than the widely used measures of per capita true wealth or assessed 
wealth.? There is some evidence that local tax programs are based in a 
crude way upon income differentials. They are crude because they are 
inelastic and are not responsive to the changes in the level of income. 
There is also some evidence to indicate that all the tax levies—local, 
state, and federal—are more highly correlated with per capita income 
than with true or assessed wealth. 





* Similar results were obtained by R. H. Johnson for Iowa. See pp. 62-65 of his An Analysis of 
Iowa Income Payments by Counties, Bureau of Business and Economic Research, State University of 
Iowa, March, 1950. 





A CRITICAL EVALUATION OF AVAILABLE 
AGRICULTURAL STATISTICS* 


Ivan M. LEB 
University of California, Berkeley 


HE first: systematic attempt to collect agricultural statistics dates 

back more than a century to the Census of 1840.' From that date 
forward an increasing volume of agricultural statistics has been col- 
lected periodically in Census enumerations decennially to 1920 and 
quinquennially thereafter. A rudimentary system of annual agricultural 
estimation was also begun about 1840 in the Patent Office. Upon Com- 
missioner Ellsworth’s resignation in 1845, however, interest in agricul- 
tural statistics subsided in the Patent Office, and it was not until after 
the Department of Agriculture was organized in 1862 that annual inter- 
census estimates were again revived.? Current monthly reports on crop 
conditions also predated the establishment of the Department of Agri- 
culture by a few months. Orange Judd, editor of the American Agricul- 
turalist, published summaries of crop condition reports submitted vol- 
untarily by subscribers to his paper for the five months, May through 
September, 1862. Judd’s efforts were the forerunner to the Depart- 
ment’s program of monthly reports on crop prospects which have been 
issued regularly during the growing season since the first publication 
in July 1863. 

Since 1863, the estimating work of the Department of Agriculture 
has expanded very greatly until today a truly impressive volume of 
agricultural estimates is published on a current basis. A few of the 
important markers along the path of expansion are the annual estimates 
of acreage, yield, production, and value of crops in 1866; annual prices 
in 1867; monthly prices in 1908; quarterly farm wage rates and the 
expanded livestock estimates in the 1920’s; and farm employment in 
the 1930’s. Along with this expansion has come a marked decentraliza- 
tion of the estimating function relegating more and more of the control 
over each stage of the estimating process from the revision of lists 
through processing of data to the state or regional offices. The domi- 
nant position held by the Department’s official estimates currently 





* A paper presented at the annual meeting of the American Statistical Association, Boston, Massa- 
chusetts, December 27, 1951. 

1M. R. Benedict, “Development of Agricultural Statistics in the Bureau of the Census,” Journal 
of Farm Economics, Vol. 21 (November 1939), pp. 735-60. 

? Walter H. Ebhling, “Why the Government Entered the Field of Crop Reporting and Forecasting,” 
Journal of Farm Economics, Vol. 21 (November 1939), pp. 718-34. 

§ Ibid. 
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make these estimates the logical focal point in any attempt at general 
appraisal. 

The substantial expansion in the volume of agricultural estimates has 
not been paralleled by major improvements in estimating methods. 
This is somewhat distressing in view of the significant developments in 
the theory of sample design—particularly in the past twenty years. 
The method which currently predominates involves the use of mailed 
inquiries to voluntary respondents for the collection of basic data and 
an assortment of devices designed to remove bias in the transformation 
of basic data into published estimates. Aside from the introduction of 
geographic stratification in the early 1900’s and periodic efforts to in- 
crease the size of samples, no fundamental changes in the sampling 
techniques underlying the bulk of the current estimates are evident since 
the beginning of the estimating program. Current official descriptions 
of methods employed pay lip service to fundamental weaknesses in 
method. Yet, at least equal prominence is given in the literature to 
official views that the estimates published are of “serviceable accuracy” 
and, although the methods possess theoretical weaknesses, they are in 
the main sound for the practical job of supplying usable estimates. If 
supplemental information in the Department’s files give empirical sup- 
port to this conclusion, research workers and others who use the pub- 
lished agricultural statistics need to be more fully informed since from a 
study of the literature currently available it is difficult for one seriously 
interested in appraisal to avoid the conclusion that the validity of the 
large bulk of estimates published is really not known. 

A complete evaluation of available agricultural statistics must em- 
brace such characteristics as breadth of coverage, geographical detail 
of estimates, timeliness and frequency of releases, etc. It is recognized 
that an appraisal of methods cannot itself be divorced from these con- 
siderations. But considering the present state of the Department’s esti- 
mating program it is the opinion of this writer that the criterion of 
reliability must have top priority over all other criteria of evaluation. 
The choice of optimum methods of estimation in an environment in 
which the estimating agency is faced with limited resources and a 
strong demand for breadth and timeliness of coverage may well dictate 
some sacrifice of precision in the estimates. But it is meaningless to 
suggest that the Department’s present methods somehow approach 
the optimal if the magnitude of the precision sacrificed is not known. 

The most significant appraisals of reliability of the present estimates 
will come from objective, empirical analyses and tests. Unfortunately, 
the paucity of necessary data precludes the possibility of an empirical 
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approach to appraisal in this paper. Neither can a detailed, critical 
review of methods which pays due respect to each of the various supple- 
mental devices employed be compressed into these few pages. However, 
the methods underlying the bulk of the estimates published, whether 
they be acreage, yield, prices, employment, wage rates, etc., are similar 
in essential features and, hence, several observations with regard to 
method are fairly generally valid across the board. Most of the points 
raised in the following pages are not new. Repetition of valid criticism 
is considered appropriate since the apparent confidence placed in the 
estimates within and without the Department leads one to question 
whether the methodological weaknesses have not been too lightly dis- 
missed. It is recognized that the uncertainties regarding the validity of 
estimates implied by the rather general comments which follow must 
in fact apply with unequal force to various specific estimates published. 

In what follows frequent reference is made to objective methods of 
estimation. As used here, this term is interpreted broadly to mean opti- 
mum methods consistent with sound sampling theory. More specifi- 
cally, it implies: (1) random selection of sampling units and the in- 
corporation in the sample design of techniques for dealing with non- 
response; (2) carefully formulated concepts of what is being observed 
in the sample and its relation to what is being estimated; (3) the re- 
striction of sample inquiries regarding such measures as prices received 
by farmers and wage rates to the respondents’ objective observations 
based on his own operations and, thus, the abolition of inquizies which 
require the respondent to estimate magnitudes for which he lacks both 
the necessary data and the appropriate statistical techniques; (4) the 
processing of sample data through formally developed estimating pro- 
cedures which when coupled with efficient sample designs yield esti- 
mates possessing desirable statistical properties; and (5) the inclusion 
along with each published estimate of the appropriate measure of 
sampling error, thus, giving the user of the published estimates mean- 
ingful measures of precision. . 


CRITICAL REVIEW OF METHODS 


Sources of possible distortions in the Department’s estimates are 
traceable to the following three main causes: (1) the selectivity of sam- 
ple response without adequate attention to the problem of nonresponse ; 
(2) problems of definition and concept of what is being measured; and 
(3) the opportunity for the injection of personal judgment by the state 
statistician, members of the Crop Reporting Board, and other profes- 
sional members of the Department at virtually every stage of the esti- 
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mating process from the initial collection of data to the final revision 
of an estimate—perhaps several years later. The first and third of 
these apply generally to virtually all estimates published on a regular 
basis while the second is applicable more specifically to prices, wage 
rates, crop yields, and in a different sense to farm employment esti- 
mates. 


Selectivity in Samples 


From the beginning of crop and livestock estimating, major reliance 
has been placed on mailed questionnaires to voluntary correspondents 
as the method of data collection. General and special mailing lists have 
been built up over the years with revision periodically to drop con- 
sistent nonrespondents from the lists and to add new, promising re- 
porters from previous Rural Mail Carriers’ surveys or from some other 
source. The possible selective character of such lists is obvious. Further- 
more, from the current mailing list for the general crop and price 
schedules as well as for the regularly conducted Rural Mail Carriers’ 
surveys, a 25 to 30 per cent response is apparently typical.‘ Selectivity 
of response, therefore, may stem both from the accumulated selectivity 
of the lists in the state statistician’s files and from the selectivity of 
response to any particular survey. The possible consequences of se- 
lectivity and nonresponse are, of course, well known and are recognized 


by those responsible for the estimates. Selectivity affords opportunity 
for an unknown bias in the final estimate while a high rate of nonre- 
sponse, aside from its implication regarding selectivity, renders effec- 
tive control of sample size and, hence, sampling error net of bias im- 
possible. 


Definition of What Is Being Measured 


Problems of defining what is being measured are apparent in all 
“judgment for the locality” type of questionnaires such as prices re- 
ceived by farmers, farm wage rates, and crop yields, and in a different 
way in some “individual farm” reports; as exhibited specifically in farm 
employment estimates. “Judgment for the locality” surveys represent 
an attempt to estimate a universe parameter through observations on 
a related sample characteristic. This procedure need not be objection- 
able if the relationship between what is being observed and what is 
being estimated is known with measurable error. In the Department’s 
estimates adequate measures of these relationships cannot be presumed 
to exist. In the price schedule, reporters are asked to estimate average 





4U.8. Department of Agriculture. The Agricultural Estimating and Reporting Services of the United 
States Department of Agriculture. Misc. Pub. No. 703, Washington, D. C., December 1949, p. 16. 
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prices received by farmers in their localities reporting a single price for 
each product which is “representative of all transactions.”® Similarly 
wage reporters are asked to report “average rates being paid to hired 
farm labor in your locality.”* What is being estimated, therefore, is not 
what prices and wages actually are but what reporters estimate them 
to be. It is difficult to understand why price and wage reporters are not 
asked to report actual prices paid farmers and actual wages paid farm 
laborers. The prevailing.supporting argument for the judgment inquiry 
seems to rest largely on the belief that superior estimates will thereby 
be obtained from samples which are too small to yield sufficiently pre- 
cise estimates based on direct reports. If samples are too small for 
adequate precision, the solution would seem to lie in increasing the size 
of the sample or otherwise altering the sample design and not in the 
adoption of judgment inquiries, which can serve only to complicate fur- 
ther objective evaluation of error. 

The basic data on which initial estimates of crop yield are based are 
the reporters’ judgments of local crop conditions expressed in per cent 
of normal. A normal condition is described officially in the following 
rather lengthy quotation. 

A normal condition is not an average condition, but a condition above 
average, giving promise of more than an average crop. Furthermore, a 
normal condition does not indicate a perfect crop, or a crop that is or 
promises to be the very largest in quantity and the very best in quality that 
the region reported upon may be considered capable of producing. The 
normal indicates something less than this, and thus comes between the 
average and the possible maximum, being greater than the former and less 
than the latter. The normal may be described as a condition of perfect 
healthfulness, unimpaired by drought, hail, insects, or other injurious 


agency, and with such growth and development as may be reasonably looked 
for under these favorable conditions.’ 


In this instance the reporter is asked not only to judge the average 
crop conditions in his locality but also to quantify this judgment rela- 
tive to what is at best a vaguely defined base. Later in the season con- 
dition reports are apparently supplemented by reports of probable 
yield for localities and state statisticians’ judgments regarding yield 
prospects. 

Of all the estimates made by the Department, crop yields seem most 
amenable to objective measurement and estimation. It is understand- 





§ Ibid., p. 239. 

* Tbid., p. 126. 

7U. 8. Department of Agriculture. The Crop and Livestock Reporting Services of the United States. 
Mise. Pub. No. 171. Washing‘on, D. C., 1933. p. 23. 
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able that early season yield estimates based on objective measurement 
may be subject to relatively wide margins of error because of the addi- 
tional component of error attributable to the relation between early 
season measurements of plant or other characteristics and final yields, 
But if these relations are subject to large errors, there seems no valid 
a priori basis for expecting condition reports to give a more reliable 
base for yield estimation. As the crop matures, yield estimates based 
on objective measurements should become more precise—a claim which 
is also made for the present methods. The objective procedure, how- 
ever, has the advantage that the precision of the resulting estimates is 
measurable. 

The conceptual problems arising in the measurement of employment 
are of a different nature. The reporter is asked to report employment 
on his own farm and is given some fairly specific guides to help him de- 
termine who is and who is not employed by definition. Current sched- 
ules define as employed during the survey week all operators who do 
any work at all, all hired workers who work one hour or more for pay, 
and all unpaid family members who work 15 or more hours. Those who 
have had occasion to work with farm employment data are aware of 
the marked consistent discrepancies between the Department’s current 
estimates and those currently released in the monthly Report of the 
Labor Force by the Bureau of the Census. Although basic differences 
in sampling method underlie the estimates by the two agencies, it is a 
reasonable presumption that a significant part of the discrepancy is 
explainable by conceptual differences in the magnitudes being esti- 
mated. It is not the purpose here to support the Census concepts over 
those used by the Department. A recent article by Johnson and Notten- 
burg, however, contains valid criticism which suggests that employ- 
ment estimates in the form now published are not directly usawle in 
most significant research problems in agricultural economics.® 


Processing of Data 


The opportunity afforded for the injection of personal judgment into 
the estimates at all stages of data processing is clearly revealed in cur- 
rent official descriptions of the procedures employed. Selected direct 
quotations representative of the many which might be cited will serve 
most effectively to depict the subjective nature of the process from the 
initial point of “editing” the data througl. the expansion of sampling 
indications into universe estimates, and finally to the final revisions 
based on supplemental indications. 





8 D. Gale Johnson and Marilyn Corn Nottenburg, “A Critical Analysis of Farm Employment Esti- 
mates.” Journal of the American Statistical Association, Vol. 46 (June 1951), p. 181-205. 
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Editing. Editing is clearly an essential function in the processing of 
data collected from individuals whether it be a sample survey or a 
complete enumeration. The real editing function is served only when 
editing is performed within a formally prescribed set of rules designed 
to rule out unuseable schedules or render them useable where the diffi- 
culty is remediable without distortion through straightforward con- 
versions of units of measurement, etc. A somewhat different concept of 
editing is implied by the following quotation from a recent Department 
of Agriculture publication: 

Editing is necessarily a rather subjective operation, but when done care- 
fully by a technician who is thoroughly familiar with what is being reported, 
the geographical area involved, and the type of frequency distribution that 
can reasonably be expected, it is a safeguard against errors which otherwise 
almost certainly would occur.® 

The highly subjective nature of the “editing” process is illustrated 
by the following description of procedure in the state offices, with spe- 
cific reference to the general crop report. (Italics in all direct quotations 
are mine.) 

(Esch column on the schedule is scanned) to make sure that none of the 
entries therein differs so much from the others as to indicate either a mis- 
understanding on the part of the reporter, or a misplaced entry on the 
schedule, or an error in listing. Even when no error of this sort appears to have 
been made, a report that is very different from the others from a given county 
may be deleted on the basis that it is unrepresentative. Most questionable entries 
are either deleted or moved into the appropriate columns, but some that 
are apparently not attributable to misunderstanding or mechanical errors 
may be left in to represent minority situations in a given county.” 

With reference to price estimates the “editing” function is likewise 
revealed as twofold: “(1) to detect, by inspection, errors made by the 
respondent in reporting prices, or by the clerk, in transcribing the 
data”; and “(2) to review the reasonableness of the quotations reported and 
thetr comparability as a whole with previous reports.”™ 

There is apparently no centralized control designed to establish and 
maintain uniform rules of editing throughout the system, But mere 
lack of uniformity may be considered a minor infraction of sound edit- 
ing practice when compared with the faulty concept of editing which 
provides opportunity for indiscriminate pruning of extreme observa- 
tions to force conformity to preconceived notions regarding the fre- 
quency distribution of observations and geographic and other uni- 
formities. 





®U. S. Department of Agriculture. The Agricultural Estimating and Reporting Services of the 
U.S.D.A. Mise. Pub. No. 703. Washington, D. C., December 1949. p. 11. 

w Tbid., p. 11. 

ul Tbid., p. 116. 
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Initial Estimates. Workers in the Department have long recognized 
the possibility of biased response to the voluntary mailed inquiry and 
have sought to adjust for this bias in the estimating procedures em- 
ployed. In the late 1920’s the graphic regression technique, involving, 
typically, the use of a scatter diagram between current estimates and 
final estimates over some past period was introduced and is used ex- 
tensively today. These regression charts are usually employed in no 
prescribed formal way but are used along with other supplemental 
analyses in arriving at recommended estimates in the state offices and 
in the review of these recommendations by the Crop Reporting Board. 
In the estimation of crop yields from sample reported condition, for 
example, a scatter chart of final yield estimates in a previous period 
against sample reported condition in corresponding years is used as an 
aid in transforming current sample data into an estimate of yield. Other 
factors such as precipitation or trend may also be introduced to permit 
multiple regression adjustments. An important bit of supplemental in- 
formation used in interpreting the sample indications is apparently the 
state statistician’s personal appraisals of yield prospects based on his 
observations as he travels extensively through his state. 

That this procedure carries through the review of the Crop Reporting 
Board is amply evident in the following official description, referring 
again to the crop report. “In the first, or State, review, a member of the 
Crop Reporting Board reads the (state) statistician’s comments and 
using much the same techniques as followed by the statistician (in the 
state), arrives at his own recommendation for each item.” The state re- 
viewer’s recommendations are then submitted to the entire Crop Re- 
porting Board, and “. . . using the same techniques which were used in 
the state office and in the state review, they (the Board) review the 
estimates for all states, approve or disapprove changes made by the 
state reviewer, and make other changes if they believe the data warrant 
them.” As a final step, “All changes are then approved or disapproved by 
the Chairman of the Crop Reporting Board.” 

Similar methods underlie price estimation. At the state level, 
“...the statistician reviews the entire set of data, compares price 
changes of similar or related commodities and in central markets, ad- 
justs for known biases, using regression charts, and makes other com- 
parisons or analyses that will help him evaluate the data.” And in the 
review, “The Board considers the recommendations from each state in 
relation to the indicated averages and the level of reported prices in 
other state, keeping in mind usual geographic differences, relative de- 





12 Tbid., p. 12. 
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gree of change since the previous inquiry, amount of change since the 
previous inquiry, amount of change shown by comparable price data 
from other sources, and available terminal-market price quotations.”” 

The specific nature of the evidence underlying the reviewers’ re- 
visions is not clear. From the description of procedure the possibility 
cannot be excluded that adjustments might be based on little more 
than a belief on the part of the reviewer that his revised estimate is 
somehow superior. Furthermore, the apparent attention given geo- 
graphic and terminal market price behavior may well serve to generate 
significant relationships in the published estimates at this stage. 

Another type of estimate apparently used extensively by the Depart- 
ment is the direct ratio or regression expansion. Claims of superiority 
of this form of estimate are offered along two quite different lines. The 
first is that in samples free from selectivity ratio or regression estimates 
frequently give more precise results than direct estimates. The in- 
creased efficiency is attributable to the fact that additional information, 
namely the relation between two universe variables—the variable, the 
mean of which is being estimated, and the control variable—is intro- 
duced explicitly into the estimating equation and, if the correlation is 
sufficiently large, a significant reduction in the variance of the estimate 
will result. The second and quite different claim is made that if samples 
are selective, this form of estimate when applied to a base relatively 
free from error frequently serves effectively to adjust for sample selec- 
tivity. This implies that a sample which is biased for purposes of direct 
estimates of livestock numbers, for example, may be significantly less 
biased for estimates of the ratio of livestock numbers this year to last 
year. 

The Department’s supporting literature cites no empirical evidence 
for either of these claims. With respect to the first, it must be recog- 
nized that the ratio or regression form of estimate may contain bias the 
magnitude of which is unknown and, hence, the increased efficiency 
suggested by reduction in the sampling variance may lead to a false 
sense of precision. Results of some experiments conducted with selected 
agricultural statistics in Ireland by R. C. Geary" suggest that for large 
samples ratio estimates work quite well. In stratified sampling, how- 
ever, where the ratio is estimated from relatively small samples within 
strata, the implied gain might be misleading. With respect to the claim 
regarding adjustment for sample selectivity, the literature provides no 





8 Tbid., p. 117. 
“4 R. C. Geary, Sampling Methods Applied to Irish Agricultural Statistics. Technical Series. Central 
Statistical Office, Dublin, September 1949. 
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convincing support. Confirmation or denial of this assertion can come 
only through continuing independent checks on individual estimates, 

Revision Process. An integral part of the estimating work of the De- 
partment is the continuing program of estimate revision. This is im- 
portant in the Department’s estimating program since final revised 
estimates play a key role in many of the current estimates. Check data 
of varying degrees of accuracy become available at the close of each 
season, or at periodic intervals, which are used in successive revisions 
towards what is considered to be more accurate estimates. For some 
estimates, fairly complete check data become available at the close of 
the season. This is apparently true for production of such crops as 
cotton, tobacco, sugar beets, rice, peanuts, and flaxseed. For other cash 
crops, check data of varying degrees of completeness become available 
from supplemental indications of railroad movements, terminal market 
receipts, etc. For a number of crops, among them the major feed crops, 
heavy reliance is placed on the quinquennial Census of Agriculture and 
annual farm censuses in states where such censuses are taken. For cer- 
tain other estimates, notably prices, the nature of the supplemental 
information on which revision is based is not clear. , 

Appraisal of the methods employed in the revision process is not 
possible since adequate accounts of the methods used have not been 
published. Current descriptions certainly leave a reasonable doubt as to 
whether the validity of a good deal of the check data is itself really 
known. C. L. Harlan writing with regard to the checks provided by the 
1945 Census of Livestock concluded, “. . . it is apparent that the 1945 
Census enumeration of livestock on farms furnished an inadequate 
bench mark, either for revising the estimates for 1945 and preceding 
years or for projecting the estimates for suceeeding years. But although 
the Census totals in themselves are very questionable as giving a de- 
pendable level of numbers of the different species as of January 1, 
1945, they were of material value to the Department of Agriculture in 
furnishing evidence for making new estimates of numbers that are 
closer to the actual numbers than were the old estimates.”!® The re- 
vised estimates as published are typically much closer to the Depart- 
ment’s original estimate for January 1, 1945 than to the census enumer- 
ation. 

From selected comparisons made by the writer between the Depart- 
ment’s revised estimates and census enumerations of acreage, yield, 
and production, the role played by census data in the revisions is not 





6 C. L. Harlan, “The 1945 Census Enumeration of Livestock on Farms,” Journal of Farm Eco- 
nomics, Vol. 29 (August 1947), pp. 691-710. 
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completely obvious. For those field crops checked, it appears that yield 
estimates were very frequently adjusted to census levels in census 
years. Postcensus adjustments in acreage for census years were also 
evident though discrepancies typically remain between revised acreage 
and census data."® 

The apparent practice of adjusting yield estimates to census levels 
in census years is probably not unreasonable. However, the census 
would seem to give little information which is useful for intercensus 
yield revisions. A census enumeration may call for a yield revision for a 
particular census year but the appropriate pattern of revision for inter- 
census years is not derivable from census yields. In addition something 
must be known regarding the trend of error in yield estimates over 
each period. Postcensus revisions of both yield and acreage for inter- 
census years were evident in the comparisons made but the revision 
procedure is not clarified by an inspection of the data. Those estimates 
for which annual supplemental data become available would appear to 
be on firmer ground for intercensus revision. However, until a great 
deal more is published concerning the precise nature of the supplemen- 
tal data and the analyses underlying the Department’s adjustments 
based on them, the user of the final statistics is at a loss to form even a 
subjective evaluation of the validity of the final estimates. 

Failure of the Department’s revisions of such magnitudes as acreage 
and livestock numbers to conform closely to census levels cannot be 
considered to reflect upon the accuracy of the revised figures. The diffi- 
culties in interpreting census data arising from such causes as under- 
enumeration, lack of comparability due to shifting dates of enumera- 
tion, unknown interviewer error, etc., are familiar to all who have had 
occasion to use census data in research. It may be questioned, however, 
whether the cloud of uncertainty with regard to the validity of the 
original estimates is dispelled by references in the literature to census 
“bench marks” used in the revisions of these estimates. It is perhaps 
not inappropriate here to express the view that for a government other- 
wise economy minded with regard to the collection of statistics, the 
census seems an extremely inefficient method of collecting information. 
Recognizing that census data are neither timely nor free from error, it 





% The comparisons were with 1934, 1939, and 1944 Census data as follows: 
Winter wheat: Oklahoma, Kansas, United States. 
Spring wheat: North Dakota, South Dakota, United States. 
Corn: Iowa, Illinois, United States. 
Oats: Iowa, Ohio, United States. 
Barley: North Dakota, California, United States. 
Rye: North Dakota, Minnesota, United States 
Soybeans (excluding 1934): Illinois, Indiana, United States. 
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must be questioned whether much, if any, information is provided by 
the census that could not be obtained more efficiently by properly 
designed sample surveys. 


CONCLUDING REMARKS 


Considering the procedures employed by the Department from the 
initial collection of data through the final revisions of estimates, it is 
clear that the usual measures of sampling precision are not applicable 
for the bulk of the estimates made. Moreover, the user of these statis- 
tics is supplied with little convincing evidence regarding their accuracy, 
Assurances in the supporting literature that estimates are felt to be of 
“serviceable accuracy” are somewhat less convincing than would be a 
series of checks based on objectively designed surveys and objective 
procedures of processing the data. If continuing independent checks 
could be found to support the official estimates, a more rational basis 
for the continuance of present methods might be found. 

The more objective surveys which have been conducted by the De- 
partment have generally been designed for special estimates not ob- 
tainable in the regular estimating program and not specifically as 
checks on estimates by regular methods. Although one purpose of the 
1947 and 1948 enumerative surveys was to provide objective checks on 
official estimates at the national and broad regional levels, they have 
apparently not provided conclusive checks. Such comparisons as are 
available in print, however, do not show generally close agreement with 
official estimates.'? Published accounts of objective experiments at the 
state level with acreage, yield, and production estimates in such crops 
as wheat, cotton, and corn likewise give no conclusive checks on official 
estimates.'* The wheat surveys of 1939 and 1940 have been the most 
comprehensive, but checks on official estimates are clouded by the pos- 
sibilities of bias in the objective estimates themselves. Objective counts 
as applied in cotton are apparently viewed more as supplemental indi- 
cations than as checks on the estimates made by regular methods. An 
interesting experiment involving samples drawn from independent lists 
and an interview follow-up of a sample of nonrespondents conducted in 





17 Catherine Senf, “A Report on the General Enumerative Surveys II,” Agricultural Economic Re- 
search, Vol. 1 (October 1949), pp. 105-28. 

18 Arnold J. King and Dale E. McCarty, An Objective Method of Sampling Wheat Fields to Estimate 
Production and Quality of Wheat. U. 8. Department of Agriculture. Technical Bulletin No. 814. Washing- 
ton, D. C. February 1942. F 

D. A. McCandless, “Objective Sampling in Estimating Southern Crops,” Journal of Farm Eco- 
nomics, Vol. 23 (February 1941), pp. 247-55. 

U. 8. Department of Agriculture, Agricultural Estimating and Reporting Services of the United 
States. Washington, D. C., p. 18. 
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some 25 states in 1948 suggested a rather consistent upward bias in 
estimates of April 1 grain stocks based only on response to the general 
schedule typically used for this purpose.!® Other experiments in the 
form of special mailed surveys with repeated mailings or follow-up in- 
terviews of a sample of nonrespondents have provided interesting re- 
sults which reflect the nature of the biases present in voluntary mailed 
inquiries.”° ; 

This experimentation in more objective methods has no doubt 
yielded suggestive results but the function which it has served in the 
regular estimating program of the Department is not clear. It is clear 
that from the assortment of objective experiments thus far conducted 
by the Department there emerges no systematic plan of objective 
checks on the large volume of important current estimates released. 

Lacking such objective checks, the user of the statistics would be 
facilitated somewhat in his individual appraisal of the estimates by 
more detailed analytical accounts of the adjustments made in the proc- 
essing of data from the initial collection to the final revised estimates. 
Analyses revealing the nature of and the basis for adjustments by state 
statisticians and members of the Crop Reporting Board might well be 
useful in appraisal of initial estimates. More complete reports of analy- 
ses of revision procedures are essential to the appraisal of revised esti- 
mates. 

It must be considered doubtful, however, that more exhaustive anal- 
yses of estimates based on present methods will, in many cases, give the 
user more than a slightly more adequate basis for his own subjective 
appraisal of their accuracy. Although this is to be preferred to the 
present lack of supporting analyses, it must be considered as an inferior 
substitute for a shift to more objective methods. 

The limiting factors most frequently cited to a shift to more objective 
methods are the familiar ones of time and costs. It cannot be denied 
that both elements are real. Yet, it is difficult for an outsider to deter- 
mine to what extent the absence of significant progress in this direction 
is motivated by an underlying conviction that the methods currently 
used are sound. Objective research demonstrating clearly the need for 
improved methods might prove the most effective device for making 





19 J. C. Scholl and C. E. Burkhead, “Interviewing Nonrespondents to a Mail Survey: An Experi- 
ment in Connection With April 1948 Farm Stocks Report,” Agricultural Economics Research, Vol. 1 
(January 1949), pp. 16-23. 

® Walter A. Hendricks, “Adjustment for Bias Caused by Non-Response in Mailed Survey,” Agri- 
cultural Economics Research, Vol. 1 (April 1949), pp. 52-6. 

George Knutson, “Winter Storm Livestock Loss Surveys in Wyoming,” Journal of Farm Eco- 
nomics, Vol. 31 (November 1949), pp. 1276-1283. 
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both more time and more funds available. The time element may be 
somewhat overstressed. A premium might be placed on sample designs 
which would facilitate data processing in the time allotted though per- 
haps at some sacrifice of precision. Also, with more objective pro- 
cedures, the time now spent in observation and review by the profes- 
sional staff would be somewhat curtailed. It is even conceivable that 
the present rigid statutory regulations governing time of release of 
many of the estimates might be relaxed if the need were clearly estab- 
lished. 

There is little doubt that a complete shift to objective estimation 
would be costly, but the economy argument should not be overplayed. 
Presumably farmers, handlers, and processors base important economic 
decisions on official estimates. Furthermore, government action pro- 
grams involving substantial expenditures of public funds are tied to the 
Department’s official statistics. In view of the increasingly important 
role being played by official statistics, the time for basic and thorough 
reconsideration of estimating methods seems already past due. 

Cost and other considerations probably mean that a complete shift 
to objective estimation will take time. During the interim period, a 
systematic program of periodic checks should be considered as a less 
costly alternative though perhaps at considerable sacrifice of informa- 
tion. Should resource limitations prohibit the immediate inauguration 
of an extensive program of periodic checks, a program of continuing 
objective checks on key estimates in selected states would seem the 
minimum acceptable if genuine progress in estimating methods is to be 
made. Although this more limited program is of questionable value for 
immediate improvement in estimates nationally, it does have the merit 
of providing really significant checks on present estimating proce- 
dures—a logical starting point for any program of improvement. 














ns 
T- 
O- 
S- 
at 
of 
)- 


' Qo HY ot 


we Sl 





PREPAID MEDICAL CARE AS A SOURCE OF 
MORBIDITY DATA* 


Neva R. DEARDORFF 
Health Insurance Plan of Greater New York 


COMPREHENSIVE MEDICAL CARE MAKES POSSIBLE 
COMPLETE OBSERVATION 


HE potentialities of voluntary prepaid medical care as a source of 

morbidity data might have been a better phrasing of our titie, since 
prepaid medical care has, for a variety of reasons, not yet produced 
morbidity data of any broad general significance. But insofar as volun- 
tary prepaid medical care in a given community covers illness of all 
types without deterrence, reports its observations accurately, and 
reaches a fair cross-section of the population or of an identifiable seg- 
ment of it, the capacity of prepaid medical care to bring into clearer 
view the extent and character of the illness prevalent in that com- 
munity would seem to be substantial and distinctive. 

In order to come as quickly as possible to a practical sense of these 
potentialities, an account is given of the services performed over a 
period of three years for a family of four persons covered by a plan for 
prepaid medical care, the Health Insurance Plan of Greater New York, 
popularly known as HIP. To make this material understandable, it is 
necessary to explain the system of medical care of which this is an 
example and the statistical system by means of which data have been 
and are being accumulated. Thereafter, we shall present some general 
comments on the statistical problems attending the compilation of 
morbidity data from this source as well as some suggestions bearing on 
methods of handling these materials as they relate to persons and 
families. 

AN ILLUSTRATIVE CASE 


First as to the family case. It was originally chosen as typical only 
in the sense that it received care in all of the categories provided under 
the Plan—services that were preventive, purely diagnostic, and reme- 
dial for diseases of varying degrees of seriousness, and were rendered in 
the home, in the physician’s office, and in the hospital. There was of 
course no thought that a single case would typify the services required 
or the illness experienced by families generally over a three-year period. 





* Presented before the session on Morbidity Statistics, American Statistical Association, meeting 
in Chicago, December 27, 1950. 
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But, as we shall see, this family’s experience cannot be regarded ag 
essentially atypical or extraordinary. The justification for dubbing it 
the “Rankand Fyle Family” will become evident as the conditions 
giving rise to the need for care are reviewed. With the exception of this 
generic name, all other details are precisely as they appear in the HIP 
utilization records. 

HIP enrollment was opened to New York City employees in the 
spring of 1947 with services scheduled to begin for them on April first of 
that year. Mr. Rankand Fyle entered at once. He was then forty-four 
years old and receiving a salary of $4,000 per year. His wife was thirty- 
nine, his older boy twelve and his little boy four. 

Altogether during the first three years of their coverage the four 
members of the Rankand Fyle family were seen eighty times by their 
family doctor and the specialists to whom he referred them. This did 
not include the services of radiologists and pathologists. In the first 
year they required forty-seven physician services, the second twenty, 
and the third year thirteen. Throughout the three year period, one 
member had forty-seven physician services, another sixteen, the third 
eleven, and the fourth, six services. In looking back over their experi- 
ence the Rankand Fyles report themselves as more than satisfied with 
the care they have received. Their doctors report that this family used 
HIP exactly as it was hoped that families would; they did not abuse 
their privileges in any way. 

Upon enrollment, Mr. and Mrs. Fyle who live in an outlying borough, 
chose a typical HIP medical group convenient to their home; within 
two weeks they had picked out a family physician to whom Mrs. Fyle 
promptly brought the boys for a general check-up. The older boy, 
Mark, had nothing the matter with him except ingrown toenails that 
could be easily remedied by the doctor. But the little boy, Alan, had a 
congenital inguinal hernia on the right side. At that time he also had a 
cold. 

The family physician arranged at once for the surgeon of the medical 
group to see Alan. This occurred five days later when an operation was 
decided upon. Meanwhile, the smallpox scare had spread throughout 
New York City so Mrs. Fyle and Alan were vaccinated by the family 
physician. By the end of April Alan had his operation. No problems 
arose and in six days he was discharged from the hospital. But just to 
make sure that everything was all right, the surgeon visited him at 
home the day after he left the hospital and six weeks later gave him a 
final post-operative check. Alan, of course, remained a patient of the 
family physician who also saw him after he came home from the hospi- 
tal. 
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Early in the summer Mark, the older boy, ran into a couple of mis- 
haps. In June he injured a finger and early in July contracted trench 
mouth. Later in July the doctor examined him preparatory to going to 
camp. In the fall when he started back to school, his eyes needed at- 
tention. Toward the end of the first year he had a cyst on his breast 
and repeated attacks of tonsillitis. The doctors decided that he should 
have his tonsils and adenoids “out” and this was done. 

In the autumn of this first year the family doctor thought Alan 
should be under the care of a pediatrician. After some observation this 
specialist thought that Alan’s condition suggested rheumatic fever. He 
was hospitalized for a week with the result that such a possibility was 
ruled out, much to the relief of his parents. 

Mr. Fyle needed to see the family doctor only twice during the first 
year, once for a small infection in his ear and once for an infection in 
his eyes. Mrs. Fyle was not only vaccinated, but also had her eyes 
tested and saw the family doctor for muscular pains and a sacroiliac 
strain. 

Altogether, during this first year Alan had thirty physician services, 
Mark eleven, Mrs. Fyle four, and Mr. Fyle two. 

During the second year Alan, still the heaviest user, had nine serv- 
ices, Mark had three, and the parents each had four. The doctors not 
only watched the side on which Alan had been operated, but also noted 
that there was evidence of hernia on the other side. It was decided by 
the family physician and the surgeon that an operation at this time was 
not required. However, Alan began to have troublesome colds and 
tonsillitis. In the early surnamer he skinned his knees and shin and had 
to be taken to the family doctor. Later he fell down and bumped his 
head so hard, the doctor had to be called to the house. 

Mark needed post-operative care after the tonsillectomy performed 
at the end of the first year. Later he developed impetigo and suffered a 
slight injury to his eye. Mr. Fyle came to the doctor for trouble with his 
foot—a bony spur on his heel with muscular pains. He also had a rash 
which the dermatologist diagnosed as follicular impetigo. Mrs. Fyle 
developed some uncomfortable but not dangerous conditions—spastic 
colon, hemorrhoids, and early symptoms of the menopause. At this 
time the family physician gave her a very careful examination and 
arranged for several special laboratory tests including blood counts. 

In the third year Alan’s continued colds led to his having an opera- 
tion for diseased tonsils and adenoids. He also had chickenpox and a 
stomach upset. The pediatrician continued general supervision. Alto- 
gether Alan had eight physician services. Mark needed only two serv- 
ices including a camp examination. Mr. Fyle needed no services at all. 
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Mrs. Fyle sprained her ankle and continued under the doctor’s general 
observation. During this time she had an electrocardiogram. 

Of the eighty physicians’ services :eceived by the Rankand Fyles 
during the three year period, thirty-seven were rendered by the general 
physician, sixteen by the pediatrician, fifteen by the surgeon, six by the 
nose and throat man, four by the eye man, one by the orthopedic 
surgeon, and one by the dermatologist. Among these services was one 
major operation, two minor operations, seventeen hospital visits by 
specialists and one by the general physician, eight home calls (two by 
specialists), twenty-one calls at the several specialists’ offices and thirty 
office calls to the general physician. As was said above, this count does 
not include laboratory and possible X-ray services rendered outside the 
hospital which are also covered by HIP. 

As has been suggested, there ic no reason to believe that the Rankand 
Fyles will, in the long run, prove to be atypical in their demand for 
service. Another three years at the rate of utilization of their third year, 
would bring them for a six year period of coverage almost to the current 
utilization rate of the total HIP enrollment. Moreover, even within 
these first three years, this family was no threat to the actuarial sound- 
ness of HIP. In its original planning, an annual average of seven physi- 
cian services per enrollee was anticipated. This would allow the Rank- 
and Fyles eighty-four services in three years. The eighty physician 
services described above together with the laboratory services per- 
formed outside the hospital came within that allowance. 


THE HIP sysTEM OF MEDICAL CARE 


Only a little more need be said about the HIP medical program as a 
source from which morbidity data can be derived. The care provided is 
comprehensive in the sense that all conditions, both physical and 
mental, are covered for the establishment of a diagnosis. It includes 
treatment for all conditions with the exception of those needing care in 
institutions other than hospitals for general care and those for which a 
psychiatrist is required. Conditions covered under the laws for Work- 
men’s Compensation and for veterans are not included as care is other- 
wise provided. Acute alcoholism is excluded but chronic alcoholism is 
not. Purely cosmetic surgery will not be provided but restorative plastic 
surgery comes within the program. Treatment for rare diseases and 
those requiring unusual skills (brain and cardiac surgery, fenestration 
operations, maxillary and thoracic surgery) are fully covered. 

All enrollees must have hospital insurance so that there will be no 
barrier to hospital service in case there is need of it. 





PREPAID MEDICAL CARE AS A SOURCE OF MORBIDITY DATA 285 


The medical services are provided through autonomous groups of 
physicians serving under the regulations and guidance of a central 
Medical Control Board. The medical groups are composed of general 
family physicians and specialists from the twelve basic medical spe- 
cialties. A central panel of highly qualified physicians takes care of the 
cases presenting rare and unusual conditions. 

A monthly capitation is paid to the medical group for each enrollee 
who selects it for his medical care. 

HIP enrollees stricken away from home with illness or accident seri- 
ous enough to require hospital care, are indemnified for medical service 
up to a maximum of $150. 

The current premium schedule for family coverage calls for an an- 
nual rate of $34.56 for one person, $69.12 for two, and $103.68 for a 
family of three or more persons. This provides an average premium 
return to HIP of approximately $29.50 per person per year. The em- 
ployer is required to pay half of the premium, or, as in the case of the 
union welfare funds, all of it. Mr. Rankand Fyle now pays $1.45 a week 
for his HIP and hospital insurance. His employer, the City of New 
York, pays a like sum. 

As of September 30, 1950, HIP had 251,306 persons enrolled with 
twenty-nine medical groups and another 1,397 persons who work for 
contracting employers or unions in New York City, but who do not 
live in the area served by medical groups. These latter persons are 
covered on an indemnity basis. 

It must be obvious that the conditions of coverage and benefits under 
plans for prepaid medical care will dstermine the conditions and the 
extent to which people are brought under medical observation. Waiting 
periods, the exclusion of pre-existing conditions and early visits to the 
physician, and extra payments, all tend to restrict the observation of 
illness and hence limit the conditions of morbidity that will be reported. 
With HIP, such limitation is minimal as can be seen in the case of the 
Rankand Fyle family. It must also be obvious that observation will be 
influenced by the ways in which physicians are organized to make and 
record observations. 


THE SYSTEM FOR THE COLLECTION OF DEMOGRAPHIC AND MORBIDITY 
DATA ON HIP ENROLLEES 


Meticulously careful records of the enrollment—the population 
base—must be maintained at all times because both the HIP central 
office and the medical groups must know currently the status of the 
person or family eligible for care. In that respect a service plan differs 
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sharply from an indemnity company which can argue after the service 
is rendered as to the status of the insured and its liability for payment. 

Upon enrollment each HIP subscriber fills out a card giving his name, 
address, age, sex, marital status and the name, age, sex, and relation to 
him of each person to be covered by his premium payment.! He is re- 
quired to cover his spouse and all eligible children under eighteen years 
of age in his household if he covers any dependents. He must designate 
the medical group that he has selected to serve him and his family. The 
enrollment card also shows other items such as the employer group to 
which the subscriber belongs, the effective date of enrollment, several 
details about the payroll deduction that he authorizes, and other data 
necessary for accounting purposes. The Registrar’s Division of HIP 
keeps these cards up-to-date by adding new members of the family, 
removing members who reach the age of eighteen, re-enrolling them if 
they elect to continue in HIP, and otherwise changing the records as 
needed. Addressograph plates are made which contain all of ihe essen- 
tial identifying data and, from these, imprints are sent out to the medi- 
cal groups and given the subscriber on his identification card. IBM 
cards are made for each person covered and are kept up-to-date as 
enrollment changes occur. These cards and their several reproductions 
serve both statistical and accounting needs. They make readily avail- 
able at all times the demographic data on the covered population and 
show the distribution of that population among the medical groups. 
Twice a year the cards on current population are tabulated for size of 
families and for the age and sex of enrollees as a whole and in each 
medical group. The records are kept in such a way that effective dates 
as well as those of terminations, renewals, and conversions can be estab- 
lished for studies of past periods. 

The data on utilization consist of a recording of each contact of a 
patient with a physician or of a visit by a nurse. Services performed by 
technicians, such as the optometrist, the physiotherapist, and those in 
the laboratory, are summarized monthly. The physician’s report on 
each person seen carries a full identification of him by means of a family 
certificate number and an individual code number which reveals his sex 
and family status as well as the month and year of his birth. This record 
also shows the employer group to which the patient belongs. These 
items can be easily copied on the physician’s report from the enrollee’s 





1 Reproductions of the basic forms and the original illustrative material used in this paper will be 
sent on request. Address, The Health Insurance Plan of Greater New York, Division of Research and 
Statistics, 7 East 12th Street, New York 3, New York. 
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identification card or from the cards furnished the medical groups. 
There are additional items in this report including the date of service, 
the physician who performed it, his medical specialty, the place at 
which the patient was seen, the type of service rendered, that is, pre- 
ventive, surgical (major and minor operations and deliveries), or medi- 
cal service of a treatment character, and an indication of the diagnosis. 

At the end of the month the medical group returns to the HIP Di- 
vision of Research and Statistics these recordings of physicians’ serv- 
ices. There they are carefully checked, first to see that every reporting 
physician is accounted for, and then for the completeness and internal 
consistency of his reports. A current monthly report is prepared on the 
over-all utilization of the enrollees of each medical group and of HIP 
as a whole from hand tabulations summarizing the returns of each 
physician. An IBM card is then punched for each service rendered. 
From March 1, 1947 when HIP opened service with 2,543 enrollees, to 
December 31 of that year when the number of persons had advanced 
to 76,000, a total of 160,000 services was reported. During 1948 when 
the enrollment rose to 120,000, reports on a total of 500,000 services 
were filed. In 1949 the enrollment which at the year’s end stood at 
222,000, received a total of 850,000 services. In each of these years the 
figures are exclusive of radiologists’ and pathologists’ services. Since 
these specialists have no patients receiving services solely from them, 
the omission of such ancillary services, estimated at approximately ten 
per cent of all other physician services, does not entail the omission of 
any patients. Since January 1, 1950 the radiologists have been report- 
ing services in the same detail as other physicians. 

Plans for the analysis of these HIP materials call for the assembly 
of the services rendered each patient and each family each year in the 
way that has been indicated on the report for the Rankand Fyle family. 
Thus far, HIP has been only partially able to put together the ma- 
terials for 1947. It is expected that each person and each family will 
have an annual summary card. 

It has been possible to analyze the services rendered in 1948 to per- 
sons of each age group and sex? without a person-by-person assembly 
of the service records because, as said above, every service report car- 
ries the coding of the recipient’s age and sex as part of his identification. 
Age-sex-specific utilization rates were computed from the total services 
for an age-sex group and the enrollee years estimated for that group. 





2 Neva R. Deardorff, “The 1948 Experience of the Health Insurance Plan of Greater New York 
with the Utilization of Physician Services by the Enrollees in Each Age-Sex Group,” American Journal 
of Public Health, Vol. 40, No. 12 (December, 1950). 
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REPRESENTATIVE CHARACTER OF POPULATION AT 
RISK AT TIME OF ENROLLMENT 


Looked at from the broad community point of view, the first ques- 
tion that arises relates to the representative character of these ma- 
terials. Can the people enrolled in a voluntary plan for prepaid medical 
care be accepted as representative of the general population or of any 
well-defined segment of it? What factors, if any, have operated to select 
enrollees of such a plan? 

These questions are more easily posed than answered. 

In the case of HIP the underwriting rules under which it now oper- 
ates, require group enrollment with an initial minimum of twenty-five 
persons when at least seventy-five per cent of the eligible persons in an 
employed group elect to enroll. It will take as few as ten when 100 per 
cent of the group enroll. In employed groups of eleven to twenty-four 
employees at least ninety per cent must accept the Plan. 

In assessing the representative quality of a body of enrollees, many 
attributes can be chosen as tests. Gross factors of selection, such as 
age and sex, can be seen and correctives can be applied with relative 
ease. But other tests are both less susceptible of application and less 
clearly established as determinants of health conditions. The factor of 
occupation immediately comes to mind. Perhaps the first point to note 
in the case of HIP is that its present population enjoys a high degree 
of stability of employment. It has almost no casual labor or heads of 
families who are unemployed. Its enrollment consists of employees of 
the City of New York—lawyers, street repair men, members of college 
and school faculties, all types of engineers and inspectors, subway and 
bus operators, accountants and statisticians, firemen, maintenance 
crews, sanitation workers, nurses, park and zoo attendants, and clerical 
workers of all types, and the eligible dependents of all of these employ- 
ees. The enrollment also includes employees of the United Nations and 
their families. A score of social agencies, about 150 small businesses, 
the welfare funds of five labor unions and the tenants of a veterans’ 
housing project with 800 families also have contracts. A great many 
occupations are represented, but not those in such important industries 
in New York City as the garment trades, heavy construction and print- 
ing, except possibly as the spouses of insured persons may be employed 
in them. The recent acceptance of the tenants of housing projects may 
serve to spread the occupational representation. Naturally, mining and 
agricultural workers are not on the HIP rolls. 

But one may question the necessity for a precise cross-section of the 
occupations within a given community if good distributions of impor- 
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tant concomitants of occupation can be realized. While death and dis- 
ability rates for occupational groups strongly suggest an association 
between health status and occupation, why this should exist is not so 
clear. At least three elements enter into it: the specific disease or acci- 
dent hazards involved in the particular work performed, the educa- 
tional status of the persons thus employed, and the income attaching to 
the given occupation. With respect to the latter two, the occupation 
would appear to be merely a symbol rather than a factor of direct im- 
port. The first factor applies only to the worker, whereas the other two 
spread over to his family. It is quite possible that if the income-educa- 
tion status of enrollees in prepaid medical care plans could be secured 
directly, that information would be sufficient for an appraisal of the 
characteristic nature of that enrollment. Certainly the facts on educa- 
tion and income, as they are being established through the United 
States Census, are more manageable, statistically, than are the data on 
the myriad of occupational titles held by the people of a metropolitan 
center as large as New York City. A well-devised sample of HIP en- 
rollment could yield the necessary data on income and education with- 
out undue expense. 

Biases arising from the over- and under-representation in an enroll- 
ment of the persons employed in industries and occupations with spe- 
cific occupational diseases and hazards, can be assessed only if there is 
precise knowledge about their presence both in the total community 
and in the enrolled group a8 well as about the disease control factors 
operating upon the workers in the two groups. It is entirely possible 
that the persons engaged in hazardous work processes, whose em- 
ployers provide prepaid medical care, are a selected group in the sense 
that their employers are sensitive to health hazards and give them 
special protection on the job. This is a complex, difficult, and technical 
subject to which general morbidity data may have only minor contri- 
butions to make. These matters clearly deserve further objective study. 
In any case, efforts should be made to establish the character of the 
enrollment in respect of education and income. The data on the mor- 
bidity experience under the Plan would thereby be carried a step nearer 
to general community indexes. 

Are there more subtle psychological factors operating within the 
groups to whom prepayment plans have been made available, that 
single out as the seventy-five per cent who enroll or the twenty-five 
per cent who may not do so, persons and families with distinctive cul- 
tural and psychological characteristics related to health? Are the people 
who enroll apt to be those who are especially health conscious? If so, 
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it is conceivable that they enter the Plan with less medical neglect than 
is prevalent among persons generally with the same education and eco- 
nomic power. On the other hand, are those who enter especially anxious 
to attend to conditions which they have hitherto been forced to neglect? 
Are the people who stay out those who feel no need for medical care or 
those already under such care and unwilling to change their doctors? 
Instances of all such types are noted from time to time, but there is no 
way of establishing their significance. The plan of group enrollment 
doubtless narrows the operation of such factors, though it probably 
does not completely eliminate them. Moreover, the tendencies and 
biases mentioned above may all be present and in varying degrees can- 
cel each other. 

After coming into the Plan, the enrollees’ behavior in availing them- 
selves of preventive services and thus coming under observation, is far 
from uniform. The utilization rates of the several labor unions in HIP 
illustrate these differences. In its first year one union had a very low 
utilization rate, although it was known to be medically needy. It later 
increased its rate. Another union had a rate in its first year well above 
that for the whole body of enrollees. The latter organization carried on 
a vigorous campaign to get its members to have health check-ups early 
in their coverage. By thus bringing their members under medical ob- 
servation, many conditions were brought to light that would otherwise 
not have been reported or at least not so promptly. Under one of its 
private contracts the HIP has an ultra-sophisticated group as regards 
physical and mental conditions—a staff of social workers, almost all of 
whom have been psychoanalyzed. One can be pretty sure that they will 
have periodic health examinations. At the other end of the scale are 
groups of unskilled laborers who, though having the first opportunity 
of their lives at high-grade doctoring outside a charitable clinic, will 
not uniformly rush to come and get it. 

Only the most elaborate studies, could, we believe, fully appraise 
the representative quality, as regards the conditions of health, of medi- 
cal neglect on admission to the Plan and of attitude toward preventive 
services of the people who have elected to come into it. We can only 
say that no evidence has come to light as yet which establishes the 
HIP enrollment collectively as peculiar or different from their fellow 
New Yorkers of corresponding ages and sex in these several respects. 


REPRESENTATIVE CHARACTER OF POPULATION AT 
RISK AFTER INITIAL PERIOD OF ENROLLMENT 


The continuance of the characteristic quality of the health experi- 
ence of this population after an initial period of enrollment is an en- 
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tirely different story. If HIP succeeds in doing what it is intended to 
do, illness will be brought under greater control and thereby reduced 
among its enrollees. In other words, the HIP program is specifically 
aimed to turn its enrollment into a selected group. The experience with 
enrollees during their first year may prove to have value as an index of 
the needs for health service among people living at the various eco- 
nomic levels. But thereafter, the HIP’s program is pointed toward 
demonstrating what can be done to raise the health status of people 
living at all economic levels and toward showing what an equalized 
opportunity for the best medical care can accomplish for them. It is in 
the latter regard that the major contribution of HIP will doubtless be 
made. In time the amount of illness should become manifest as it ap- 
pears among people, whatever their income, who have complete access, 
on a non-charitable basis, to full medical care rendered by physicians 
motivated to prevent illness—a possibility which sharply distinguishes 
HIP from medical indemnity plans. As has been said, it should be 
possible to measure medical neglect by analyzing the conditions found 
among persons during the early years of their enrollment. The experi- 
ence of the Rankand Fyle family described earlier, illustrates both of 
these possibilities. 

As for measurement of the extent to which illness is prevented after 
enrollment, the problem is at least as difficult as the establishment of 
the representativeness of the enrollees on admission. Ideally, this is a 
situation in which control group technique should be employed. But no 
plan for prepaid medical care is in a position to establish a properly 
chosen control group and thereafter to keep it under observation for a 
period of years. Neither, so far as can be seen, is any other health or- 
ganization, public or voluntary, in a position to do so. But there are 
some possibilities for comparison of certain types of experience. HIP 
has itself compared, for maternal, fetal, and neonatal mortality, and 
for mode of delivery, the outcome of 1,015 obstetrical cases delivered 
during the year ended June 30, 1949, with the experience of New York 
City as a whole. 

It has been suggested that the relative amount of hospitalization 
could reflect the effects of early diagnosis and treatment. It would, of 
course, be necessary to establish rates of hospitalization specific for age 
and sex and to make sure that all hospitalization in the group with pre- 
paid care and in the community generally were taken into account. It 
would also be necessary to distinguish between first and subsequent 
years of enrollment and to give attention to cultural factors and admit- 
ting policies of both hospitals and referring physicians. 
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The mortality experience may have possibilities. It certainly cannot 
be ignored. Again the comparison would have to be established for age- 
sex groups. If it were possible to ally comprehensive prepaid medical 
care with life insurance for a group of policy holders, perhaps a study 
could be devised that would yield results that could be scientifically 
validated. It might then be possible to measure with considerable pre- 
cision the effects on life expectancy of a system of long term, compre- 
hensive prepaid medical care that kept fully abreast of the advancing 
frontiers of medicine. 

Returning from this digression as to measurement of the differences 
in health between people enjoying the benefits of a system of prepaid 
medical care and those not so covered, it is necessary only to point 
again to the improbability of the experience of a population under con- 
tinuous care being an accurate reflection of the morbidity conditions 
in the total population at any given point in time. 


ANALYSIS OF MORBIDITY DATA ON PERSONS AND FAMILIES 


It remains to explore the ways in which these materials on the health 
of enrollees can be put together to indicate the extent and type of illness 
among them. Of the many pertinent questions on statistical objective 
and technique that arise under these circumstances, two have been 
chosen for discussion here as of primary significance. These are: 

1. What unit is best adapted to the analysis of morbidity as it comes 
under observation in prepaid medical care of the comprehensive 
service type? 

2. How can the available morbidity data on families be managed? 
The raw material out of which the statistical analyses must be made 
can be clearly seen in the Rankand Fyle case with which the discussion 
began. But the statistical treatment of the material remains to be de- 
vised. 

UNITS OF ANALYSIS OF MORBIDITY DATA ON PERSONS 


The casting of the morbidity materials into data which are com- 
parable from year to year and from group to group, requires immedi- 
ately the selection of the unit to be counted. Is that unit an episode of 
illness or injury or is some other unit better adapted to the materials? 
It seems clear that to translate all of these data into episodes intro- 
duces artificiality in many instances. To the extent that disease which 
might become serious is not permitted to do so, its sharp episodic char- 
acter is flattened out. The reported conditions no longer take the shape 
of well-defined unitary happenings that can be separately spotted, de- 
scribed, measured, and counted. Instead of a limited number of pro- 
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nounced attacks, we have a vast number of tiny skirmishes with dis. 
ease. This creates the necessity for devising other ways of describing, 
statistically, the conditions of health that characterize the population 
at risk. The Statistical Division at HIP has not reached a final decision 
on the des:zriptive unit or units, but it is moving in the direction of 
setting up as the unit to be counted a person with a given diagnosis in a 
year. Recently, a trial analysis was made of a small number of cases 
with these questions and problems in mind. For this purpose a straight 
run of 93 families from the files of one medical group, augmented by 
seven families known to have had a baby delivered during that period 
(a total of 395 persons) was used. All of these families had children. 
They had the same physicians for the entire year of 1948. 

The analysis appearing in Table 1 is intended solely to explore the 
problems of extracting diagnostic data from the visit-by-visit reports 
made by physicians as they saw patients and of relating it in conven- 
tional ways to age and sex. Instead of episodes or illnesses, a year of 
observed experience for the person has been the unit characterized and 
counted. Each person is counted but once. This method limits know|- 
edge of the incidence of specific disease entities, especially the minor 
ones, since a choice of one condition has to be made and, naturally, the 
one most significant for basic health was chosen. But with the virtual 
disappearance of many acute communicable diseases as well as other 
serious acute infectious conditions, and the increase in the visibility, at 
least, of chronic conditions, incidence as such becomes a less significant 
measure of total disease than prevalence. Moreover, the incidence of 
all reportable diseases is established by other means. In about one-half 
of the cases under medical observation used in the trial run, the doctor 
reported only one condition. In the other half of the cases other diag- 
noses were reported either concurrently or at different times during the 
year. In this connection it is well to recall that specialist care in the 
medical groups is easily available and that people often ask for special- 
ist service and are referred for conditions of the eyes, of the skin, and 
for minor postural and other defects which might be passed over in 
ordinary fee-for-service medical practice. The fact that the patient had 
other diagnoses was indicated in the table by the letter “W” meaning 
“with other conditions.” With regard to multiple diagnoses the nota- 
tions on these cases present the usual problems and are susceptible of 
the same treatment as are the reports on illnesses collected in any other 
way. It would be possible to code and count all persons with each con- 
dition reported if that were thought desirable. The unit then would be a 
person-diagnosis-year. 
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In general the situation can be summarized by saying that compre- 
hensive prepaid medical care combined with the group practice of 
medicine offers unusual opportunities for bringing people under very 
close medical observation for minor as well as for serious conditions; 
however, it is better adapted to the reporting of prevalence than of 
incidence since recurring incidents of the same disease are not very 
clearly delineated. While it is not likely that every minor condition with 
which every patient was beset is reported, it is probable that the omis- 
sions are minimal since group practice tends to bring patients under the 
eyes of more than one physician, thus offsetting the tendency of a given 
physician to report only the conditions with which he is especially con- 
cerned. 


DATA ON FAMILY HEALTH CONDITIONS 


Very little statistical effort has gone into the description of family 
health conditions and of changes in them from year to year, probably 
because hitherto there have been few, if any, systems of medical care 
that combined current reporting with observation of families as such. 
There is, however, a growing necessity for more serious consideration 
of the problem. The interrelation of the health conditions of family 
members is a subject of increasing medical interest. Certainly medical 
care and the meeting of its costs are family obligations. But the family 
is a difficult statistical unit with which to deal descriptively and iongi- 
tudinally. Groups of families vary widely in many characteristics and 
a single family may change in size, composition, and several other re- 
spects within a relatively short time. Moreover, problems of health and 
disease abound in detail and complexity. To bring all of these concepts 
into good working relationships will require careful analysis, planning, 
and experimentation. 

As the first move in the analysis of the HIP family data a rough over- 
all classification of family health conditions for a year, was set up. It 
has twenty groupings which progress from no apparent need for medical 
care to serious conditions of more than one kind. 

This classification entails a definition of serious and of minor condi- 
tions. A serious condition is conceived to be one that may be caused by 
disease, trauma, or physical defect which immediately jeopardizes life 
or substantially reduces full working capacity. A minor one, though not 
carrying these extreme dangers, causes discomfort, temporary disfigure- 
ment, weakens resistance and/or vitality, or otherwise depreciates a 
person’s health. While the diagnosis itself will usually indicate the in- 
stances of serious illness, it will not always do so, as in the cases of con- 
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trollable chronic conditions. Attention must, therefore, be paid to ac- 
companying conditions: hospitalizations, operations, home visits, and 
the total volume of care reported for the year. The element of judg- 
ment required in coding these determinations could be reduced through 
the formulation of rules on these matters. The status for the year of 
given family is determined by the most serious condition suffered by 
any of its members. 


FAMILY HEALTH STATUS CLASSIFICATION 
(As used in Illustrative Table 2) 


First class No medical services reported 

Second class Preventive service only 

Third class Minor acute conditions and/or minor accidents only 

Fourth class Minor chronic conditions and/or other minor conditions only 

Fifth class Obstetrical and/or pediatric care only 

Sixth class Obstetrical and/or pediatric care with any or all types of minor conditions 
Seventh class Obstetrical and/or pediatric care with any or all types of serious conditions 
Eighth class Correction of serious physical defects, with or without minor conditions 

Ninth class Serious acute conditions with or without minor conditions 

Tenth class Serious chronic conditions with or without minor conditions 

Eleventh class Serious traumatic conditions with or without minor conditions 

Twelfth class Serious mental conditions with or without minor conditions 

Thirteenth class Serious acute and serious chronic conditions with or without minor conditions 
Fourteenth class Serious acute and serious traumatic conditions with or without minor conditions 
Fifteenth class Serious acute and serious mental conditions with or without minor conditions 
Sixteenth class Serious chronic and serious traumatic conditions with or without minor conditions 
Seventeenth class Serious chronic and serious mental conditions with or without minor conditions 
Eighteenth class Serious traumatic and serious meatal conditions with or without minor conditions 
Nineteenth class Serious conditions in other combinations 

Twentieth class | Conditions not elsewhere classified 


The cases with obstetrical conditions were set apart because they 
carry a distinctive and substantial need for services both for the mother 
and for the baby, if it is born within the year reported. Much of that 
care is health supervision and should be distinguished from services to 
the sick. 

While all of the families used for this trial analysis did not imme- 
diately and automatically classify themselves into the twenty groups, 
there was comparatively little difficulty in their assignment to one or 
another of the classes. A total of thirteen of the rubrics were required 
for this small block of cases. The Rankand Fyles were not included 
among them, but if they had been, they would have been classed in 
group 8 for the first year and group 4 in the two succeeding years. Over 
a period of years the classifications for a family would reflect its shifts 
in health status. 

Having classified the families in terms of health status as here de- 
fined, a few tabulations of related data were made to see how the classi- 
fication would stand up. But before commenting on those results, the 
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extent to which the family members came under medical observation 
during the year should be noted. Of the 100 families, in 47 instances all 
of the eligible family members had come under such observation and in 
79 instances all but one member were seen. Four of the families did not 
come for any medical care during the year. While we can no more claim 
characteristic quality for this block of 100 families on this point than 
on any others, it is necessary to make clear that HIP deliberately pro- 
motes the ideal of preventive medicine and family care throughout its 
entire enrollment and hopes eventually to educate its enrollees to have 
at least an annual health check-up for each member of the family. 
When that is in effect, it certainly should be possible to classify families 
according to health status, insofar as that depends upon their having 
had medical review. 

The data set forth in Table 2 are intended in part to augment the 
information about the 100 families (Items I and II), but primarily to 
relate the basic classification to several factors of medical care (Items 
IV, V, VI, and VII) in order vo see whether these would support the 
original classification or cast doubt upon it. As was said above, in some 
instances these items were themselves factors in determining the class 
as when a major operation of a family member led to the family’s as- 
signment to a class that had had a serious condition, or when a repeti- 
tion of visits helped to indicate the chronicity of the condition reported. 
In these cases discrepancies have been deliberately avoided. 

Looking at the other data for this obviously non-random sample, 
nothing is disclosed that is inconsistent with the suggested classifica- 
tion. The patterns of medical service show a relationship to the medical 
need implied in the definitions of the several classes and to the limita- 
tions under which the care is rendered. As was pointed out earlier, the 
HIP contract does not cover medical care in an institution other than 
a general hospital (that is, the HIP physicians do not, in fact, cannot 
follow patients into public institutions or voluntary institutions that 
are not general hospitals) ; it does not cover Workmen’s Compensation, 
Veterans’ Administration cases, or treatment of mental diseases, though 
the enrollee may receive diagnostic services which help to establish his 
need and eligibility for these other types of care. That means that some 
very serious illnesses will receive comparatively few services, but insofar 
as they are seen and diagnosed by the HIP physician, the family can 
be classified. Utilization rates for persons in families with some other 
types of serious conditions are also disturbed by the dispersal of family 
members and the disintegration of the family which sometimes results 
from the condition itself. The effects of serious illness on families of 
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various types need a great deal more attention than they have received 
to date. While the data from prepaid medical care cannot tell the whole 
story, they supply an excellent means of case finding and some back- 
ground for such study. 

Because families themselves are units which present difficulties in 
the establishment of measurements, it seems likely that for many 
analyses of family health conditions, the units will be persons in families 
with given characteristics. In Table 2 there are several illustrations of 
such analyses. 

While on the whole the concept of a classification of a family’s health 
experience has not been challenged by these results, it is abundantly 
clear that this attempt is rudimentary and leaves much to be debated 
and worked over. 

The first sentence of this paper, written in the autumn of 1950, uses 
the word “potentialities” to warn the reader against expecting a presen- 
tation of morbidity data already compiled from this comparatively new 
source. At that time only a few soundings could be offered. A footnote 
may now be added. The potentialities of the HIP data accumulated 
over a period of four years are now being exhaustively tested. Substan- 
tial grants for this purpose were made by the Commonwealth Fund and 
the Rockefeller Foundation. The Milbank Memorial Fund is aiding 
generously with consultation service. A distinguished group of biome- 


tricians, with Dr. Lowell J. Reed as Chairman, serves as a Steering 
Committee. A final report is due by July 1954. 





SOME CASES IN WHICH YATES’ CORRECTION 
SHOULD NOT BE APPLIED 


Epwin L. Crow 
U. 8. Naval Ordnance Test Station, Inyokern 


RANZ ADLER has recently discussed the lack of uniform and explicit 

rules for the application of Yates’ correction for continuity in 2X2 
tables.'! One question raised is easily answered explicitly, and has in fact 
essentially been answered by Paulson and Wallis.? The applicability of 
the correction is questioned in cases in which it affects the algebraic 
sign of the deviation of the observed frequency from the expected fre- 
quency (in any one of the four cells of the table). 

In these cases, in other words, the observed frequency differs from 
the expected frequency by not more than $. But this is as close to the 
expected frequency as any observed frequency, being an integer, could 
come; no closer agreement with the null hypothesis of independence 
(or homogeneity) is possible. Hence Yates’ correction should not be 
applied in such cases, and it follows immediately, with no further 
calculation, that the observed frequencies are not inconsistent with the 
null hypothesis at any level of significance. 

The x? test may be performed without reference to expected fre- 
yuencies using the formula cited by Adler: 


ot (| AD — BC| — N/2)2N 
x "(A + B\(C + D)(A + C)(B + D) 





where A, B, C, D are the observed cell frequencies and N is the total 
frequency. Relative to this formula the cases considered above are 
those in which |AD—BC| $N/2. Consequently, when this inequality 
is true, it follows immediately that the observed frequencies are not 
inconsistent with the null hypothesis, at any level of significance, with 
no further calculation. 

In the foregoing it is tacitly assumed, as is often done with the x? 
test, that one is testing against a two-sided alternative. If one is testing 
against a one-sided alternative, the null hypothesis corresponds to in- 
dependence or association in one direction, positive say, while the 





1 Franz Adler, “Yates’ Correction and the Statisticians,” Journal of the American Statistical Associa- 
tion, Vol. 46 (1951), pp. 490-501. 
- 2 Edward Paulson and W. Allen Wallis, “Planning and Analysing Experiments for Comparing Two 
Percentages,” Chapter 7 in Statistical Research Group, Columbia University, Techniques of Statistical 
Analysis (New York: McGraw-Hill Book Company, Inc., 1947), pp. 247-265. 
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alternative corresponds to negative association. If the observed fre- 
quency differs from the expected in the direction of positive association, 
there is complete agreement between the data and the null hypothesis, 
If the observed frequency differs by not more than 3 from the expected 
frequency in the direction of negative association, then the agreement 
with the subclass of the null hypothesis corresponding to independence 
is as close as possible. Hence if one is testing against a one-sided alterna- 
tive and the observed and expected frequencies differ by not more than 
4, it can likewise immediately be concluded that the data are not in- 
consistent with the null hypothesis, at any level of significance. 





PROCEEDINGS 


AMERICAN STATISTICAL ASSOCIATION 
111TH ANNUAL MEETING 


HOTEL COPLEY-PLAZA, BOSTON, MASSACHUSETTS 
DECEMBER 27, 1951 


MINUTES OF THE ANNUAL BUSINESS MEETING 


The meeting was opened with Past President Samuel S. Wilks in the chair. 
Hugo Vivo, representative of the Cuban Chapter of the Association, was intro- 
duced to the membership. 

Lowell J. Reed, retiring President of the Association, read his presidential 
address (which was published in full in the March 1952 issue of this Journal). 

After delivering his presidential address Dr. Reed assumed the chair and called 
upon Aryness Joy Wickens to report for the Committee on Elections. Mrs. 
Wickens reported that the following officers had been elected: 


President: Aryness Joy Wickens; President-Elect: William G. Cochran; Past- 
President: Lowell J. Reed; Vice Presidents: Morris H. Hansen, Walter E. Hoad- 
ley, Jr., Herbert Marshall; Directors: Gertrude M. Cox, W. Edwards Deming, 
Howard L. Jones, William R. Leonard, John W. Tukey, Raiph J. Watkins; 
Secretary-Treasurer: Samuel Weiss; Members of the Council: Waite S. Brush, 
Jerome Cornfield, Lucile Derrick, A. Ross Eckler, William W. K. Freeman, 
William Haenzel, Werner Hochwald, Paul G. Hoel, John W. Hopkins, Palmer 
O. Johnson, Howard Levene, Rensis Likert, Frederick Mosteller, Orville B. 
Bailey, John R. Stockton, W. Allen Wallis, Sylvia Weyl. 


W. G. Cochran, representing the Committee on Fellows, announced the elec- 
tion to Fellowship by his Committee of the following members of the Association: 


Richard Loree Anderson David John Finney Bruce D. Mudgctt 
William Forrest Cal- Harold Adolph Freeman  K. R. Nair 

lander Emil Julius Gumbel Henry Scheffé 
Jerome Cornfield Paul Gerhard Hoel Henry Soladay Shryock 
Besse Beulah Day Vladimir Stepanovitch Gerhard Tintner 
Daniel Bertrand DeLury Kolesnikeff Jacob Wolfowitz 
Paul Sumner Dwyer Margaret Merrell William John Youden 


(The detailed report of the Committee on Fellows was published in the February 
1952 issue of The American Statistician.) 

_ Morris H. Hansen read the report of the Board of Directors for 1951 which is 
published in this issue of the Journal in full. 

The Secretary-Treasurer, Samuel Weiss, read the report of the Secretary- 
Treasurer which is printed separately in this issue of the Journal. 

The reports of the Committee on Elections, the Committee on Fellows, the 
Board of Directors, and the Secretary-Treasurer were voted accepted by the 
membership. . 

Lowell J. Reed reported that as a result of continued absence from the United 
States, Philip M. Hauser had resigned from his position on the Board of Directors 
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of the Association and that Herbert Marshall, Dominion Statistician of Canada 
had been appointed to replace Hauser for the term expiring at the end of 1952. _ 

Upon a motion from the floor, the membership of the Association voted to 
express its appreciation of the arrangements for the Annual Meeting made by the 
local Arrangements Committee of the Boston Chapter and to express in particu- 
lar to Wendell MacDonald, Chairman of the local Arrangements Committee, 
and to Harold Dorn, Chairman of the Annual Meeting Program Committee, the 
membership’s appreciation of their efforts. 

Lowell J. Reed turned the chair over to Aryness Joy Wickens, the incoming 
President of the Association, who spoke briefly to express her honor at her elec- 
tion. 

Benedict Saurino, Chairman of the Committee on Institutional Membership, 
reported on the activities of his Committee during 1951. Alfred Watson, Chair- 
man of the 1952 Annual Meeting Program Committee, gave a progress report on 
the activities of his Committee. (Committee reports will be published in full in 
The American Statistician during the course of 1952.) 

Mrs. Wickens presented to the membership a proposal by the Board of Di- 
rectors that the 1954 Annual Meeting of the Association be held in Canada during 
the Summer. After discussion from the floor, it was voted that the Board of Di- 
rectors be instructed to meet in Canada in 1954 and that it experiment with a 
change of time for the 1954 meeting. 

A rising vote of thanks was given to Lowell J. Reed for his conduct of the affairs 
of the Association during 1951. The meeting was adjourned. 


Report of the Board of Directors for 1951 


The calendar year just ended has been marked by consolidation and growth 
of the effectiveness of the Association; by establishment of closer and more 
flexible cooperation between the Board and the Secretary’s Office; by the de- 
velopment of new activities by sections and committees, and by the launching of 
a variety of new projects. 


Committees 


A major step in the organizational field was the appointment of a Liaison 
Committee by the Board of Directors to deal with the Secretariat, to acquaint 
the Board more closely with operational problems and to serve, in a sense, as 
both an Executive and a Finance Committee. Established toward the beginning 
of 1951, the Liaison Committee has met regularly with Secretary Samuel Weiss 
and the Assistant to the Secretary, Sylvia C. Weyl. Through this organization, a 
great deal of work has been accomplished informally without the necessity of 
polling the Board as a whole on minor policy matters. The members of the 
Liaison Committee were Lowell J. Reed, President of the Association, Aryness 
Joy Wickens, President-Elect, Dorothy 8. Brady, Morris H. Hansen, and Ralph 
J. Watkins. 


Program Committee 


The Program Committee for the 1951 Annual Meeting organized an unusually 
successful meeting with Statistics and the Development of Man as its guiding topic. 
Joint sessions were held with fourteen other professional associations, represent- 
ing economists, mathematicians, financial analysts, engineers, public health ex- 
perts, population experts, sociologists, actuaries, biologists, physicists, and others. 
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The chairman of the Program Committee was Harold Dorn of the National Insti- 
tute of Public Health. 


Other Committees 

The Committee to Advise the Bureau of Labor Statistics on its Price Indexes, 
under the chairmanship of Bruce Mudgett, has been working closely with the 
Bureau of Labor Statistics’ staff. During 1951, the Committee advised on re- 
vision of the wholesale price index which is almost completed and on revision of 
the consumers’ price index. 

Under the chairmanship of Benedict Saurino of the Sun Oil Company, the 
Committee on Institutional Memberships is conducting an energetic and suc- 
cessful campaign which has already raised total institutional membership from 
13 to 24. It is gratifying to note that all old institutional members have renewed. 
About 200 letters of invitation were sent out on November 1 to business and 
industrial firms which have a substantial interest in statistical matters and it is 
expected that further favorable responses will be forthcoming during the next 
few months. 

The Association’s Committee to Advise the National Research Council Com- 
mittee for Research in Problems of Sex, under the chairmanship of William G. 
Cochran, continued its study of the statistical problems of the American Insti- 
tute of Sex Research, Inc. and has made a detailed report to the N R C Commit- 
tee and the American Institute (Dr. Alfred C. Kinsey and co-workers). 

During 1951, the Association has published all Committee reports in The 
American Statistician. 


New Committees 

The Association has recently organized an Advisory Committee on Statistical 
Policy to the Division of Statistical Standards, Bureau of the Budget, at the re- 
quest of that Governmental Agency. Composed of Past Presidents of the Associ- 
ation, the Advisory Committee is chaired by E. A. Goldenweiser. The first 
meeting, held on October 9, was devoted chiefly to the problem of the confidential 
status of individual returns to Federal statistical agencies and to the view of the 
Hoover Commission experts that confidentiality may result in duplication of 
effort and greater costs to the taxpayer. In consultation with representatives 
from five governmental agencies, the Advisory Board will consider other basic 
policy problems in the field of governmental statistics. 

In view of the increasing role of statistics in the physical sciences, the Associa- 
tion has organized a Committee on Statistics in the Physical Sciences under the 
chairmanship of John W. Tukey. Operating through various specialized sub- 
committees and working groups, the Committee will plan joint sessions and 
special programs with the various professional associations concerned and will 
work toward better mutual understanding of the role of statistics in the natural 
sciences. 


Sections 

This is the first year that the three sections of the Association—Biometrics, 
Business and Economic Statistics, Training of Statisticians—have been formally 
organized and operating. These three sections, together with the Committee on 
Statistics in the Social Sciences and the Committee on Statistics in the Physical 
Sciences, have borne the brunt of responsibility for organizing the Annual Meet- 
ing program. 











308 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


The Section on the Training of Statisticians has been cooperating closely with 
the Committee on the Mathematical Training of Social Scientists and is at pres- 
ent working on a program of Seminars to be attended by university and college 
teachers of statistics. In addition, it is sponsoring the preparation of an article 
to describe the training requirements for a statistical career. The Association 
plans to use this material in advising vocational centers and prospective students 
in the field of statistics. 

The Section on Business and Economic Statistics, under the chairmanship of 
Walter E. Hoadley, Jr., in addition to planning a number of sessions at the 
Annual Meeting, set itself the responsibility of organizing an operating pro- 
gram in the field of business and economic statistics. Early in 1951 when analy- 
sis revealed that almost half of the Association’s members were concerned with 
business and economic statistics, the Section Committee determined that it 
should— 

1. Study the basic causes for certain cleavages among members (e.g. academic 
versus non-academic, mathematical versus non-mathematical), 

2. Make greater effort to obtain manuscripts dealing with significant business 
and economic statistics problems, for publication by the American Statistical 
Association, 

3. Attempt to aid the Association in raising additional funds from the business 
community to support a wider publication program, and 

4. Explore the possibility of holding certain meetings at times other than dur- 
ing Christmas week. 


Survey of the Statistical Operations of the Bureau of Mines 


The Association has entered into a contract with the Bureau of Mines of the 
United States Department of the Interior under which a project has been estab- 
lished to survey the statistical operations of the Bureau. An Advisory Committee 
consisting of Raymond T. Bowman, as Chairman, with Clarence D. Long, Jr., and 
Ralph J. Watkins as members, has been advising the Bureau and the Director of the 
project. The survey itself is being made under the direction of J. E. Morton and 
a small professional staff. Work started in July 1951 and it is expected that a 
final report will be completed and turned over to the Bureau at the end of July, 
1952. This report will analyze the statistical operations of the Bureau and ap- 
praise their quality, efficiency, and adequacy. The chief purpose of the report 
will be to present a series of recommendations which the Bureau of Mines may 
apply in improving the quality of this very important part of the statistical pro- 
gram of the United States Government. 


Report of the Secretary-Treasurer for 1961 . 


During 1951, the Secretary’s Office devoted a large part of its efforts to main- 
taining extremely careful financial management and, as a result, completed its 
second successive year in the black. This improved financial situation, it should 
be emphasized, was due, not to membership growth, but to rigorous economies. 
The actual net surplus for 1951 was approximately $7,000.00. Accordingly, the 
Association begins the year 1952 with a surplus of more than $10,000.00. 

Expenses during 1951 were about $44,500.00 after putting sales items on a net 
basis. The budget had not made adequate allowance for rising costs and thus the 
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extent of actual retrenchment is masked. Total income was forecast at about 
$46,000.00, but was actually approximately $51,500.00. Of this increase, about 
$2,000.00 was from sales of publications, $1,100.00 from new Institutional Mem- 
bers resulting from the intensive work of the Institutional Membership Com- 
mittee under the Chairmanship of Benedict Saurino, $900.00 from Journal 
subscriptions and $600.00 from additional advertising revenue from the Member- 
ship Directory. 

Membership in the Association has remained approximately stationary for the 
past few years. In view of the growing importance of statistics in science, industry 
and government, the Board has decided to launch a vigorous drive for new 
members. Two thousand dollars is allocated for this in the 1952 budget, the 
target being 1,000 new members during the twelve-month period. This appropria- 
tion has been made possible by a retrenchment policy which in three years 
brought the Association out of an unsatisfactory financial situation into one of 
stability and strength. This new drive for membership should be of substantial 
aid in building up a greater surplus, in line with former President Samuel Wilks’ 
proposals for the future development of the Association. 

During 1951, we published a Membership Directory for the first time in five 
years, which was distributed gratis to all Association members. Unlike its prede- 
cessors, the Directory lists membership, not only alphabetically and by geo- 
graphical area, but by section as well. The Directory is being sold to Journal 
subscribers who are not members and to the public at large for $2.50 a copy. 
It is hoped that this will reduce the net cost of this venture to the Association. 

Upon the request of the International Statistical Institute, Sylvia Weyl of my 
office has prepared a plan for the financing, editing, and publication of an 
abstracting journal by the ISI. All of the national member organizations, the 
American Statistical Association among them, are to participate in this project, 
if finally approved, by providing editorial staff to cover their national publica- 
tions and by assuming responsibility for distributing the abstracting journal in 
their respective countries. This plan is before the ISI Executive Committee for 
decision and it is hoped that approval and details can be announced early in 1952. 

In considering operations for the year 1952, we have estimated income at 
$53,400.00 or some $2,000.00 over 1951 performance and we have budgeted our 
expenses at about $50,000.00. This leaves net income of about $3,500.00 and 
provides for an additional and fairly substantial increment to surplus. 

The prospects are for a new period of growth and expansion. This trend is 
emphasized both by the expanding role of statistics in the life and work of the 
nation and by the rise in employment, particularly in the national defense area. 
It is our hope that the Association will be able to make its contribution to the 
problems of the present period and that we will be able to plan in the coming 
years a steady, but financially sound, expansion of our activities. 

SAMUEL WEISS 
Secretary-Treasurer 


To the Board of Directors of 
American Statistical Association 
I have examined the attached financial statements of American Statistical 


Association relating to the year ended December 31, 1951. My examination was 
made in accordance with generally accepted auditing standards and, accordingly, 
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included such tests of the accounting records and such other auditing procedures 
as were considered necessary in the circumstances. 

The recorded cash receipts for the year were traced to the deposits shown on 
the bank statements and the amounts for dues and subscriptions were tested with 
the membership and subscription records. The paid checks were inspected and 
related vouchers tested in support of cash disbursements for the year. The bank 
balances were reconciled with amounts reported directly to me by the deposi- 
taries and the cash on hand and securities owned at December 31, 1951, were 
verified by inspection. I did not check the membership and subscription records 
in detail or make any independent verification of the inventory of old Journals, 
the office records of which are based, in part, on data assembled in prior years. 

In accordance with a resolution of the Board of Directors, the expense incwrred 
in publishing a directory, distributed to the membership in 1951, is being spread 
over a three-year period although such costs would appear to be applicable 
primarily to the year 1951. The amount deferred at December 31, 1951, aggre- 
gated $2,131.86. 

In my opinion, the accompanying statements present fairly the position of 
American Statistical Association at December 31, 1951, and the results of its 
operations for the year, in conformity with generally accepted accounting princi- 
ples applied on a basis consistent, except as mentioned in the preceding para- 
graph, with that of the preceding year. 

JAMES G. JESTER 


AMERICAN STATISTICAL ASSOCIATION 
BALANCE SHEET 


December 31, 
Assets 1961 1950 
Cash in banks and on hand...................00- $28,618.28 $21,544.82 
on aks dae 46h ede dee ee Oke 1,385.48 1,787.97 
Investment in United States Savings Bonds, 

Series G, due 1962, at cost.................. 3,100.00 3,100.00 
Inventory of old Journals, at approximate cost... . 1,625.85 1,824.83 
Inventory of monograph on Acceptance Sampling, 

i pin akh ee hack wha nde rah bb eens snes 452.54 785 .54 
Inventory of emblems, at cost..................-. 588 .00 
Furniture and fixtures, at cost less depreciation... . 2,026.99 2,358.94 
Deferred charges: 

Deferred Membership directory expense......... 2,131.86 
Cnn 54445060 408 pnnnandecee cngscns 723.48 1,133.15 





$40 652.48 $32,535. 
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Liabilities 
pocountes payable. .....cccccccccccccsvccccevers $ 4,811.46 $ 6,248.52 
Deferred income (collections applicable to subse- 
quent year): 
EE cc ipieawksisninesde pees aeeaseneins $13,618.00 $14,525.50 
GebeetigtheGs. .. osc ccccccccccccccsccvcccsoons 4,379.60 4,796.80 
NN bei tides hadnt eeeadenaesah wane 4,365.67 655.50 
$22,363.27 $19,977.80 
Net worth: 
Life membership reserve. ...........-++eeeeee- $ 3,044.10 $ 2,991.50 
Surplus, per statement.............-02e ee eeee 10 ,433 .65 3,317.43 
$13,477.75 $ 6,308.93 





$40 ,652 .48 


AMERICAN STATISTICAL ASSOCIATION 
STATEMENT OF INCOME AND SuRPLUS ACCOUNTS 


$32 ,535 .25 


Year ending December $1, 





1951 1950 
Income: 
Dues—current year... .....ccccccccccccccces $33,358.00 $32,176.68 
Dues—prior year... ......ccccccececcsioncees 400 .00 260 .25 
Life membership income. ............-++++++- 84.40 75.12 
Babeorigtoes. .... 0c csccccscccccccccceneses 9,433.15 8,840.71 
PEI a 56 bsc hc dcccdsrsvcccsscdecsceness 2,251.13 1,602.27 
Journal sales, less cost of sales..............-- 1,907 .03 1,367 .35 
American Statistician sales. .............+++-- 158.65 129.15 
Acceptance sampling monograph, less cost of 
Mik ndveressncnkenssccaesdbbavenan kees 210.60 664.13 
Annual meeting income. ...........--+++++00- 452.17 
Mailing list income. ...........-scccesccceees 683 .34 727 .64 
pL  PPPETTTTTETTCOT TTT TT TTT 490 .80 304.93 
Fentawest IMOOGRS. . 0 oc ccc cccccccccecccosccees 395.40 324.96 
Reimbursement of overhead expenses—Bureau of 
Mines project... ........ccccccscccccescess 1,500.00 
DERDORTIOMOOES. 205 occ cccccccsccsccscccecoess 702 .63 745.91 
$51,575.13 $47,671.27 
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Expenses: 

Journal—printing, mailing and reprints........ $ 9,997.19 $10,622.43 
MRROEOD GE WEBIR. 05sec cc cccccsccesecces 14,817.96 15,436.81 
Pmmorients GAMIGEOR.. 2... 5 oo nc ccc cc ccccweees 4,815.68 4,941.96 
Membership directory (expense apportioned to 

2 s4G Pad 1G4 6464504000460005 000050089 1,233.50 
Annual meeting expense...............00eeee- 141.29 
PT cackencanvessneetsbhscaebeusnbanretede 2,400 .00 2,350.00 
Office supplies, printing and mimeographing.... 1,659.39 1,756.92 
ee ere errr eT eee rer eer 1,911.95 1,616.25 
EEE OTE CET Te eee ee 2,375.00 
Telephone and telegraph...............-.+5-: 482.11 743.49 
Travel and committee expense—officers........ 1,196.52 777.99 
Depreciation of furniture and equipment....... 510.86 512.10 
6.056 65 d0 ce cecevisdescasdes 970.00 720 .00 
SEE FE OTT Ce Ree 1,947.46 1,795.50 

$44,458.91 $41,273.45 

Excess of income over expense for the year..... $ 7,116.22 $ 6,397.82 
Surplus or (deficit) account 

At beginning of the year................... 3,317.43 (3,080.39) 

SCOOT ECCT LOT $10,433.65 $ 3,317.43 














REPRINTS OF ABSTRACTS IN STATISTICAL 
METHODOLOGY 


Edited by Max A. WoopsBury, Princeton University 


POPULATION INDEX 
1952 


83. U. S. Bureau of the Census. 

United States censuses of population and 
housing 1950. Key to published and tabu- 
lated data for small areas. Washington, 
Govt. Printing Office, 1951. iv, 50 pp. 

This “Key” presents in tabular form an 
index to the statistics that will be published 
or available in the form of tabulated but un- 
published data for various types of areas 
smaller than states. There are included as 
appendixes lists of standard metropolitan 
areas, urbanized areas, census tract areas, 
and block statistics cities. 


497. Britton, D. K., and Hunt, K. E. 

The sources and nature of statistical in- 
formation in special fields of statistics. The 
agricultural statistics of the United King- 
dom. Journal of the Royal Statistisal 
Society 114(1): 59-98. 1951. 


498. Canada. Dominion Bureau of Sta- 
tistics. 

The organization and purposes of the trial 
census. Pp. 17-19 in: Annual report of the 
Dominion Bureau of Statistics for the fiscal 
year ended March 31, 1950. Ottawa, King’s 
Printer, 1950. 54 pp. 

A 1949 trial for the 1951 census, utilizing 
procedures for “mark sense” processing. 


524. American Public Health Association. 
Modern methods in the sampling of hu- 
man populations. American Journal of 
Public Health 41(6): 647-668. June, 1951. 
Overall title for three papers presented at 
the 1950 meeting of the Association: 
Cochran, William G. General principles 
in the selection of a sample, pp. 647-653. 
Cornfield, Jerome. The determination of 
sample size, pp. 654-661. 
Hansen, Morris H., and Hurwitz, William 
N. Some methods of area sampling in a local 
community, pp. 662-668. 


528. Cochran, William G. 

Recent developments in sampling theory 
in the United States. Pp. 40-66 in: Inter- 
national Statistical Conferences. Proceed- 
ings ..., Washington, D. C., September 6- 


18, 1947. Vol. III, Part A. Calcutta, Eka 
Press [no date]. French summary, p. 66. 


542. Stephan, Frederick F. 

History of the uses of modern sampling 
procedures. Pp. 81-110 in: International 
Statistical Conferences. Proceedings... , 
Washington, D. C., September 6-18, 1947. 
Vol. III, Part A. Calcutta, Eka Press [no 
date]. French summary, pp. 110-112. 


562. U. S. Bureau of the Census. 

Census publications. Catalog and subject 
guide: January-June 1951. Washington, 
Govt. Printing Office, 1951. 72 pp. 


563. U. 8S. Bureau of the Census. 

Census publications. List of publications 
issued ... except regular monthly and 
quarterly releases. Washington, 1951. June, 
1951. 4 pp. 

Issued through September 1951. 


PSYCHOLOGICAL ABSTRACTS 
1951 


7780. Bartlett, M. S. (U. Manchester, 
Eng.) A further note on tests of significance 
in factor analysis. Brit. J. Psychol. Statist. 
Sect., 1951, 4, 1-2.—In a previous paper the 
author retained in the proposed significance 
test for principal components analysis an 
unresolved doubt on the appropriate num- 
ber of degrees of freedom to be assigned to 
the reduced x?, after elimination of one or 
more significant components. A “more de- 
tailed examination” of a particular case, 
while confirming the extra complication in- 
troduced by the standardization, has also 
helped to resolve this point. The reduction 
in the total number of degrees of freedom 
due to standardization is only partially felt 
in the smaller roots and disappears en- 
tirely when the components eliminated are 
large enough. That this rather peculiar 
result is essentially associated with stand- 
ardization is demonstrated.—G. C. Carter. 


7787. Fix, Evelyn. ( U. California, Berke- 
ley.) Discriminatory analysis: II. Factor 
analysis and discrimination. Randolph 
Field, Tex.: USAF Sch. of Aviation Medi- 
cine, 1950. (Proj. No. 21-49-004, Rep. No. 


313 
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2) iii, 82 p.—This is a survey of the litera- 
ture on the theory of factor analysis with 
special emphasis on the relations between 
factor analysis and discriminatory analysis. 
Bibliography of over 200 items.—A. 
Chapanis. 


7788. Hodges, Joseph L., Jr. (U. Cali- 
fornia, Berkeley.) Discriminatory analysis: 
I. Survey of discriminatory analysis. 
Randolph Field, Tex.: USAF Sch. of 


Aviation Medicine, 1950. (Proj. No. 21-49- 
004. Rep. No. 1) iii, 115 p.—This is a non- 
technical review of the literature on dis- 
criminatory analysis and related topics. 
Mathematical research projects are sug- 
gested in relation to the medical and psy- 
chological problems of Air Force selection 
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and classification programs. Bibliography of 
over 250 items.—A. Chapanis. 


7789. Hughes, Harry M. ( U. California, 
Berkeley.) Discriminatory analysis: III. Dis- 
crimination of accident prone individuals. 
Randolph Field, Tex.: USAF Sch. of Avia- 
tion Medicine, 1950. (Proj. No. 21-49-004, 
Rep. No. 3) iii, 43 p.—To illustrate the use 
of non-normal discriminatory analysis, 
the technique is applied to the discrimina- 
tion of accident-prone individuals. Data 
and examples are drawn from industrial 
and Air Force literature. The arguments for 
accident proneness in the literature are 
criticized and the need for better mathe- 
matical models of proneness is pointed out. 
—A. Chapanis. 
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BOOK REVIEWS 


Methods of Operations Research. Philip M. Morse and George E. Kimball. 
New York: John Wiley and Sons, Inc., 1951. Pp. 158. $4.00. 


J. L. Hopes, University of Chicago 


s MUST by now be well known to everyone, operations research (or oper- 
tions analysis or operations evaluation) is a technique developed during 
the late war for bringing scientific methods to bear on military operations. 
The first sentence of this book gives the following definition: “Operations 
research is a scientific method of providing executive departments with a 
quantitative basis for decisions regarding the operations under their con- 
trol.” First in England, and then in this country and elsewhere, teams of 
scientists and technical men were organized to work with various service 
commands. Their duty was to make objective evaluations of military opera- 
tions, and in many instances their studies resulted in important or even 
spectacular improvements in operational efficiency. 

The present book is essentially a report, prepared at the end of the war, 
and principally concerned with the work of the Operations Research Group, 
U. S. Navy, although some illustrations from other sources are discussed. 
The first and last of the eight chapters have been rewritten with an eye to 
possible nonmilitary applications. After an introductory first chapter, there 
follow chapters on probability, measures of effectiveness, strategical kine- 
matics, tactical analysis, gunnery and bombardment problems, operational 
experiments, and a concluding chapter on organizational problems. There 
are short tables of random digits, angles, and normal deviates, and of the 
binomial, normal, and Poisson distributions. 

The bulk of the work is devoted to the discussion of examples, and these 
are likely to be of wide and general interest. The readers of this JouRNAL, 
however, will be particularly interested in what the book has to say concern- 
ing operations research as a new field for the application of statistical meth- 
ods. On this point, the authors state (p. 7), “The most important single 
mathematical tool of operations research is probability and statistical 
theory.” That they are thinking of statistics in an advanced sense is made 
clear in a discussion of the selection of operations research workers (p. 141): 
“Mathematicians are particularly useful, especially if they have training in 
the field of probability (which does not mean the usual course in statistics).” 

The basic concepts of probability are presented in the second chapter. The 
presentation, which occupies only about twenty pages, is interesting and 
will be of particular appeal to the reader with a background in physics. Such 
notions as distribution and expected value, and such distributions as the 
binomial, Poisson, and normal, are explained and illustrated. Throughout 
the book, a primary technique for analyzing military operations is to formu- 
late a completely determined probability model, (i.e., a model without 
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parameters having to be estimated from data), and to draw conclusions 
about the operations from deductions within the model. The general spirit 
which motivates much of this work is akin to that in statistical mechanics 
and other applications of probability theory to physics. The extent to which 
such methods prove to be of value in military applications is perhaps sur- 
prising. 

When we turn from probability to statistics, the situation is very much 
less satisfactory. The book is almost devoid of any formal statistical theory, 
comparable to the material on probability. The only techniques presented 
as such are the chi-square test and a method of interval estimation attributed 
to Karl Pearson. Most of the statistical reasoning is permeated by a not- 
clearly-stated inverse approach. For example, on p. 99 an operation is dis- 
cussed in which it is desired to know which of two tactics has the greater 
probability of success. The two probabilities of success start out as para- 
meters of the problem, but at the end the authors are computing the proba- 
bility that one of them exceeds the other. 

In general, the material on statistics corresponds to that which was current 
forty or fifty years ago. The Pearsonian flavor is emphasized by the repetition 
(in the example discussed on pp. 36-37) of his original mistake concerning 
the degrees of freedom in chi-square. Again, for a short table of random 
numbers the authors have employed Tippett’s table instead of one of the 
later and better compendiums. The reader will search in vain for an exposi- 
tion of the notion that an estimate should be accompanied by a standard 
error, or for the elements of hypothesis testing. He will read (p. 24) that 
“The standard deviation is, of course, the expected value of the root-mean- 
square deviation.” 

More serious, he will find examples which may tend to inculcate a dis- 
regard for the innate variability of data and for the necessity for taking this 
into account in drawing conclusions. For example, on p. 83, in an analysis 
of high dives by suicide planes on naval vessels, the dives are classified ac- 
cording to their quarter of approach. The numbers of attacks, and the num- 
bers of successful attacks, in the five categories are 1, 6, 10, 13, 5, and 1, 3, 
2, 5, 4, respectively. The authors conclude that it is “apparent” that the 
category in which there were 2 successes in 10 attacks corresponds to a rela- 
tively unfavorable tactic. Again, in another example (p. 82) it is given that 
ship anti-aircraft batteries scored hits in 85 out of 144 cases while maneuver- 
ing, but in 82 out of 124 cases while not maneuvering. The authors state 
that the difference is “probably significant, considering the number of cases 
reporting.” No attempt is made to quantify the significance, although these 
would be excellent examples with which to explain the idea of the significance 
of a contingency table. 

It can be argued that refinements of statistical technique would have been 
out of place in the work of most operations research units during the war. It 
is stated, p. 38, that “successful application of operations research usually 
results in improvements by factors of 3 or 10 or more.” In the most famous 
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examples, the analyses yielding these improvements employed only very 
elementary statistical techniques. Perhaps a reason why operations research 
has not caught on in peacetime activities to the extent hoped for by many of 
its advocates is simply that there is not, in established concerns operating 
in stable fields, this kind of cream to be skimmed. An operations research 
worker who is attempting to bring about an improvement of a few per cent 
in the operations of a concern which is already quite efficient, will have to 
be able to use the most efficient possible tools, and these will more often 
than not be of a statistical nature. It seems a pity that this book, which with 
its many attractive features is sure to be the text for nearly all of the growing 
number of training programs in operations research, should give the impres- 
sion that statistical theory has made no advance in the past half century. 


Quality-Control Handbook. J. M. Juran. New York: McGraw-Hill Book Co., 
1951. Pp. viii, 800. $10.00. 


W. Grant Ireson, Stanford University 


HE reader, accustomed to the usual technical handbook, will be happy 

to find that Dr. Juran’s Quality-Control Handbook differs radically from 
the customary form and style. The authors have employed the natural form 
of expository writing, with only enough lists and tabular forms to support 
the text material. They have not resorted to the handbook style of making 
direct, factual statements without further explanation. This complete pre- 
sentation of ideas makes the book more valuable for the beginner in quality 
control. Furthermore, the size of the page and the large, easily read type- 
face employed contribute to the usefulness of the book. The reader will not 
encounter the eye strain and the desire to lay the book aside that is so fre- 
quent in technical handbooks. 

Dr. Juran has organized this handbook into fifteen sections, with con- 
tributions by seven prominent men in quality control and two corporations. 
Dr. Juran has written several sections himself. 

The first six sections are devoted to the managerial aspects of quality 
control. These sections give complete coverage of such subjects as economics, 
organization for, specification, acceptance and assurance of quality. This 
coverage will be of interest to top management personnel and managers and 
directors of quality-control and inspection departments. The authors have 
drawn heavily from their own experiences and reports of quality-control 
efforts from many industrial concerns to bring to the reader a collection of 
proved practices. The reader will find in each section an analysis of the over- 
all problem, a breakdown of the problem into contributing factors, and sug- 
gested treatments or practices for obtaining the desired solution. No attempt 
is made to solve the problem for the reader; however, the reader will find 
that the ideas presented will stimulate his own analysis of his individual prob- 
lems and assist in their solutions. 
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Sections seven and eight are devoted to the statistical procedures that are 
employed in quality-control work and the paper work involved. The dis- 
cussion of statistical methods begins with the most elementary concepts of 
central tendency of populations and proceeds through the usual methods 
for treating sample data and drawing conclusions therefrom. Analysis of 
variance and linear correlation are treated briefly. Simple illustrations of all 
techniques are included. Acceptance sampling techniques are discussed 
fairly completely and sample tables from Dodge and Romig’s Sampling 
Inspection Tables, the Statistical Research Group, Columbia University’s 
Sampling Inspection tables, and Military Standard 105A are included. These 
tables are provided as illustrations of acceptance sampling schemes and the 
methods of constructing sampling inspection tables. 

The remaining sections of the handbook are devoted to applications of 
quality-control principles (including acceptance sampling and quality as- 
surance) in several types of industry, including textile, chemical, aircraft, 
electron tubes, and screw-machine operations. These illustrations will be 
very valuable to practicing personnel who are responsible for the develop- 
ment and operation of quality control procedures in a given company. The 
applications represent the best current practices and point out many of the 
difficulties encountered in developing satisfactory procedures. There are 
many unique examples of the use of statistical methods for solving trouble- 
some quality problems. These should be helpful to the reader by way of 
pointing out techniques that might not seem applicable to the situation at 
first thought. 

There are two appendices: a glossary of symbols and tables and charts for 
many statistical methods. The tables and charts include Normal Distribu- 
tion Areas, Normal Curve Ordinates, Distribution of t, Distribution of x?, 
Distribution of F, Factors for all Shewhart Control Charts, and Table of 
Squares. ; 

Dr. Juran has attempted to compile in one volume the mass of knowledge 
of quality-control procedures and methods and the best techniques available. 
This results in a book that combines the managerial aspects and the technical 
aspects of quality control. Unfortunately, since this book is in a field that is 
rapidly developing new methods and concepts as a result of the large number 
of persons employed directly in research, it cannot be completeiy current. 
On the other hand, its value lies in its summary of all previous work in qual- 
ity control. It will be especially valuable as an introductory text; it will give 
the reader a broad comprehension of the many interrelated problems of 
quality control, but it cannot be used to provide adequate instruction in the 
statistical methods. The statistical techniques are neglected to a considerable 
extent. 

The book will have considerable value to the statisticians who need to 
_ understand the managerial problems of quality control. Parts of it will be 
helpful to experienced quality-control personnel in providing a good collec- 
tion of ideas representing many points of view on each subject. These will 
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tend to force the individual to analyze his problems from different view- 
points and reduce the number of cut-and-dried attacks on quality-control 
problems. 


The Extrapolation, Interpolation and Smoothing of Stationary Time Series with 
Engineering Applications. Norbert Wiener. New York: John Wiley and Sons, 
Inc., 1949. Pp. ix, 163. $4.00. 


Joun W. Tukey, Princeton University 


t was, I believe, in 1942 that copies of Wiener’s report on this subject, 
I stamped CONFIDENTIAL, began to circulate among those concerned with 
the military applications, mainly to fire control, of prediction and smooth- 
ing. Like other books and reports with a yellow cover, it became known 
as “the yellow peril.” It was followed by a host (at least a dozen to my 
knowledge) of similarly classified “simplifications” or “explanations” of 
the procedure. The volume under review consists of 123 pages devoted to a 
practically verbatim version of Wiener’s original report, a five-page table of 
Laguerre functions, and 32 pages devoted to two of the simplifications, 
written by Norman Levinson, which had already appeared openly in the 
Journal of Mathematics and Physics. 

There is much in this book which the statistician interested in time series, 
noise, or stochastic processes (these are all words for the same thing) would 
like to know, but he may not find it easy to dig them out (the writer did not!) 
If this review assists the digging at all, it will have served one of its main 
purposes. 

We had best begin by describing a fairly general problem, and the struc- 
ture of its treatment, as a background for a description of Wiener’s work. Let 
us suppose that we observe 


a(t) = y(t) + 2(t) (1) 


for various values of t, where x(t), y(t), z(t) are stationary time series, in- 
dividually and collectively. Stationarity means, heuristically, that 


Xo(t + 8) = yott + 8) + Zo(t + 8), 


for any fixed functions yo(t) and z¢(t) is as likely to represent (1) for any one 
s as for any other. Let us suppose further that our task is to estimate 


y (to) 


and that the criterion of badness of estimation is the mean square error (in 
mathematician’s language, we work in L?). This is the situation familiar to 
the statistician of trying to free measurements, z, from errors, fluctuations, 
or noise, z. Even the criterion is familiar. 

These two suppositions of stationarity and mean-square criterion are the 
twin cruxes of the matter. When they hold the book applies, when they fail 
seriously, other methods may be better. 





320 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


The statistician is familiar with using the structure of y and 2 in such 
problems. If we can assume that y(t) is a straight line (or a polynomial of 
fixed degree) and that the 2(#) are independent of y(t) and of each other, 
then we all know how to handle the analogous problem by fitting a straight 
line and substituting in the result. This will be smoothing if we substitute 
at an observed point, interpolation between, and extrapolation outside ob- 
served points. These familiar methods are not useful here without change, 
since stationary time series are not representable by polynomials. 

The correct simple representation is in terms of the distribution of variance 

‘over (angular) frequency (w in cos (wt+y¥) is the angular frequency). Such 
a decomposition in to variance components exists for any stationary time 
series. The ubiquitous existence of such a (power or variance) spectrum is a 
consequence of the fact that the only solutions of 


f(t + 8) = af) 


are exponentials, and the only bounded ones are of the form 


S(t) = ce”! = ¢ cos wit + ic sin at. 


The choice of a method for calculating y(to) falls into three steps: 

I. By subject insight, by analysis of data, or by crystal ball, determine 
the spectrum of y, the spectrum of z, and their cross-spectrum (more later 
about these). 

II. If z(t) is known for all time, we may deal with each frequency sepa- 
rately. Since we know the variance components at this frequency we can 
specify the best regression coefficient (which is in general complex) for the 
corresponding frequency component of the estimate on the corresponding 
frequency component of z(t). Then we can reassemble the parts and find a 
linear expression for the optimal estimate of y(to) in terms of x(t) for all time. 

III. Knowing what linear combination is best for all time, we can take 
account of our limited knowledge of x(t), which rarely extends infinitely far 
into the future, and modify this to find the best linear expression which uses 
only available data. 

The first of these three steps is not discussed in Wiener’s book. Relatively 
useful techniques, indeed, were not developed until after the war. This is a 
straightforward statistical problem, and one in which basic experience is 
currently being gathered. (When Wiener says, as on page 44, that the co- 
herency matrix is Hermitean, the reader should be careful to conclude that 
while the individual spectra are real, the cross spectra are complex—two 
real spectra, conveniently called the cospectrum and the quadrature spec- 
trum, are needed to describe the mutual relations of a single pair of stationary 
time series.) 

The third of these three steps arises without the second when we are 
predicting a series measured without error (that is, t9>all ¢ for which 2 
known, and 2(¢)=0). The third chapter of Wiener’s book is devoted to this 
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case, and a complete solution is found for certain simple spectra. This solu- 
tion involves the factorization of an analytic function according to the 
relation of its poles to the real axis and is far from perspicuous to those 
not deeply steeped in complex function theory. (A more heuristic approach 
is given by H. W. Bode and C. E. Shannon in the Proceedings of the Institute 
of Radio Engineers, 38 (1950) 417-425, although this version will require 
careful consideration before it seems clear.) 

In the fourth and fifth chapters, Wiener discusses first measurements 
with error, and then measurements on two or more series. In both of these 
cases steps II and III are present in combination. 

It is unfortunate, from the point of view of the statistician, that Wiener 
prefers, for the sake of favorite mathematical tools, to describe the ensemble 
(population) of time functions with which he is concerned in the form 


{x.(t) = f(t + s)} 


instead of dealing with a probability (measure) defined for sets made up 
of the sort of functions to be considered. The latter is easier for the statis- 
tician to think about. Both are equally valid mathematically, requiring 
different mathematical tools for the delicate limiting processes with which 
inferences based on observed data will not usually be concerned. 

There is much mention in the book of both autocorrelation functions and 
spectra. The theoretical development would limp without the use, at least 
implicit, of both. Mathematically, it is easiest to define the autocorrelation 
function (which statisticians would rather see called the autocovariance 
function) first, and then the spectrum. Thus, there appears to be a dis- 
symmetry to the theory, which speaking mathematically is not there. (As 
a matter of computation engineering, when handling data, it seems best to 
find the autocovariances first when proceeding digitally, and the spectrum 
first when proceeding analogwise. As a matter of interpretation, the reviewer 
can learn far more from the spectrum.) 

This is a book which can reward long and careful study, and which 
requires it. The delay in reviewing it here is not the responsiiility of the 
Editors, but solely of the reviewer, who waited until he thought he under- 
stood it well enough to try to help others. 


Statistical Methods for Chemists. W. J. Youden (National Bureau of Standards, 
Washington, D. C.). New York: John Wiley and Sons, 1951. Pp. xi, 126. $3.00. 


K. A. BRowNLEE, University of Chicago 


yor in the Preface to his Statistical Methods for Chemists uncom- 
promisingly nails to the mast his flag of non-conformity with the 
doctrine that those who use statistical methods must have a complete 
grasp of their mathematical background—‘“The book is characterized by 
an absence of statistical theory and proofs.” In fact he goes further and 
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carries the war into the enemy’s camp with the statement that “young 
men who have been given a fine mathematical grounding in statistics often 
have an inadequate grasp of the ways in which measurements behave.” 
This controversy had perhaps be best left to the operations of a free market. 
Those who want the mathematics will shop elsewhere; those who want ex- 
perimental experience and a heuristic account of the working of statistical 
methods will be happy to purchase Youden. 

Youden specifically directs his volume to working chemists—‘“The intro- 
ductory material has been held to a minimum, because I believe that labora- 
tory men know a great deal about data that is not known to students taking 
a school course in statistics.” The volume is further limited in size to 115 
pages of actual text, plus tables of t, F for the five and one per cent levels, 
and squares of numbers from 1 to 999. This modest scope makes it somewhat 
unfair to make criticisms of omissions, except when they are positively 
misleading. For the record, however, it must be stated that the chi-squared 
test for discrete data, the control chart, tests for randomness, multiple 
regression, covariance, and transformations of the variate, are among the 
topics not treated. We may agree with Youden that “the omissions reflect 
my conviction that many of the ordinary situations confronting the scientist 
require knowledge of only a few statistical techniques” (page v). It is a 
pity, however, that the comparison of means with unequal variances is 
not mentioned, as it is the reviewer’s opinion that this situation does arise 
frequently in chemical work and that faulty conclusions are often drawn 
through failure to appreciate it, particularly when the numbers of observa- 
tions involved are small. 

The book starts with discussions of precision and accuracy, including the 
standard deviation, the normal law of error, degrees of freedom, confidence 
limits, the ¢ and F tests. These sections read well, though some statisticians 
may have their own reasons to smile uneasily at the statement (page 4) 
“Statistical formulas are internationally accepted and, like a table of 
international atomic weights, are useful in avoiding unnecessary disputes.” 
The section on the rounding of data (page 7) is much too short to do justice 
to this important (and frequently neglected) practical problem. On page 15 
the generally used computational formula for the sum of squares of devi- 
ations from the mean, ) 2?— (>> 2)?/n, is used in a numerical example before 
the explanation on page 16 that it is algebraically identical to )_ («—7)*. 
Similariy, at the bottom of page 21, after a statement that a statistical 
test, devised by Bartlett, will test the homogeneity of variances, it is a little 
startling to read next “The acceptable upper limits for B may be read 
from a statistical table of chi-square,” there having been no previous men- 
tion of B. The point is clarified on the next page. On page 25 it might help 
the reader to explain the origin of the n and m in the formula for the ¢ 
test for the difference between two means, rather than bluntly present the 


factor ./|[nm/(n+m)]. 
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After an account of the analysis of variance between and within columns 
of unequal size there follows a chapter “The Resolution of Errors” devoted 
mainly to distinguishing between sampling and analytical errors. The 
example on page 35 of coal analyses by laboratories drawn in four pairs 
at random from a particular group of eight, the resulting pairs happening to 
be A and D, F and D, H and E, B and E, whereupon a component of vari- 
ance for Between Laboratories is isolated and then used for predictions 
about the population of all possible laboratories, cannot be considered the 
kind of practice one likes to see recommended. The reliability of such an 
estimate of a component of variance, though difficult to compute exactly, 
must be exceedingly low and quite unsuited for general predictions. 

The subsequent example on the sampling of wool from bales would have 
gained substantially from an introduction of the relative costs of sampling 
and analysis, particularly as the final conclusion implies that more bales 
should be sampled with fewer analyses per bale, a conclusion that might 
easily be reversed if costs were taken into account since it was stated that 
the expense of sampling was relatively high. 

The chapter on the “Statistics of the Straight Line,” including the line 
constrained to pass through the origin, is a model of clarity and relevance, 
though Youden avoids potential trouble by confining himself to the situation 
where the error in z is small compared with that in y and the only use made 
of the lines is consideration of the regression coefficient and the intercept. 

There follow two chapters on the analysis of variance, in which an example 
of “interaction” is given (page 59) which should immediately satisfy all 
chemists. The second of these chapters ends with a very sketchy account of a 
three factor analysis of variance which suffers from being purely hypotheti- 
cal, containing no example, and which gives no indication of the troubles 
which can arise in the interpretation of such tables of analysis of variance. 

On pages 80-81 the advantages of the analysis of variance for making 
comparisons amongst several (say n) means is represented solely as the 
avoidance of having to make $C separate comparisons. While this is one 
good reason, another more important is that it avoids the disturbances of 
the significance level that arise when multiple simultaneous non-independent ¢ 
tests are made. 

On pages 88-91, as an example of randomised blocks some radioactive 
counts are used. While there is nothing wrong with this example the subse- 
quent discussion darts backwards and forwards from counting rate to 
absolute count. This would be easier to follow if in consistent units. It is 
further confusing to read “Assuming that the radioactive disintegrations 
follow a Poisson Law, the theoretical precision of such counts is known to 
be the square root of the number of counts,” since the precision was defined 
on page 10 as the variance. 

The remaining three chapters deal with experimental design. In the first, 
randomisation is given satisfactory emphasis. The second discusses ran- 
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domised blocks, Latin squares, balanced incomplete blocks, and Youden 
squares. An omission in the latter is that though a satisfactory numerical 
example dealing with thermometer standardisation is taken through to its 
analysis of variance the reader is left at the point where the estimated 
standard deviation of a single reading is found. The much more useful 
statistic, the standard error of the difference between two corrected means, 
is not given. 

The treatment of Latin squares is built around a satisfactory example 
(page 92) of an experiment on the breaking strength at three different 
aging times of twenty-seven test specimens of cement prepared in nine 
molds, using three mixes. The molds were segregated into three groups, 
each forming a Latin square, with mixes, molds, and aging times as the three 
factors. The analysis of variance (Table 47) has the residual labelled “Resi- 
due: mixes X molds,” which seems somewhat misleading as it also contains 
components of the times Xmolds and times X mixes interaction. The state- 
ment on page 98, “Furthermore the row and column factors must not inter- 
act with each other” also misleads by failing to add that the treatment X rows 
and treatments Xcolumns interactions also must not exist. The reviewer 
finds the subsequent development of this example obscure. It appears that 
it is concluded that molds are without significant effect and the time X mix 
interaction is then calculated. This seems to violate the rule that the form 
of the analysis of variance should be laid out in advance and not altered to 
suit the results as they arise. The method of calculating the interaction in 
its two components in Table 48 is not given: “The isolation of the above 
sums of squares for the time Xmix interaction is left as an exercise for the 
reader. Success in this exercise may be taken as denoting a good compre- 
hension of the analysis of variance technique.” The reviewer feels that this 
is quite unreasonable. A high degree of sophistication is necessary to recog- 
nise that the I component of the interaction is associated with the mold 
sum of squares and the J component with the residual. The effect of this 
section, if taken seriously, would be to cause despair in all chemists and 
many statisticians. 

The last chapter attempts to deal in eight pages with the analysis of 
variance of a four factor experiment, confounding of the 2‘ experiment in 
blocks of 8 plots, partial confounding of the 2° experiment in blocks of 4 
plots, confounding of the 2° experiment in blocks of 8 plots, and the half 
replicate of the 2° experiment. This whole chapter suffers grievously from a 
lack of experimental material, there being no actual examples. The reason 
can be implied from page 111: “Among chemists the reception so far ac- 
corded this experimental device (confounding) is reminiscent of the young 
man seeking employment. No one will hire him until he has experience, 
and he cannot get experience until someone gives him a job.” While one can 
sympathise with the young man, and with Youden, there are probable 
grave dangers in placing these methods in the hands of inexperienced 
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chemists without any illustration of the way they often work out in practice. 
The chemist, if he was sharp-witted enough to recognise the surprising diffi- 
culties that often arise with actual data might very well be discouraged as 
Youden gives him no help here. 

The text ends with a sketch of the application of fractional replication to 
a 2° system starting out as a quarter replicate and proceeding, by dropping 
out factors and running more plots, to a full replicate of a 2°. No mention 
is made of the fact that in the initial quarter replicate all six factors have 
first order interactions as aliases and this weakness could lead to seriously 
faulty conclusions. 

The volume ends with a short bibliography of six items, the tables, and 
an index. The tables of F, incidentally, do not go beyond m= 12, which seems 
an unnecessary limitation. 

An overall assessment of the book’s achievement would conclude that 
within its limited ambition, namely to be a primer of statistics for chemists, 
Youden has done a satisfactory job. The book is of such a length, and cast 
at such a level, that it should not be difficult to persuade chemists to read it. 
That they will profit by it very considerably is indubitable. The few infelici- 
ties of exposition will not trouble conscientious readers, and so long as 
they realise that they are dealing with a primer and not with a compendium 
they will be well served. 


Theory of Mental Tests. Harold Gulliksen. New York: John Wiley & Sons, Inc., 
1950. Pp. xix, 486. $6.00. 


KENNETH E. Cuiark, University of Minnesota 


= book deals with the mathematical theory and statistical methods 
used in interpreting test results, and with the quantitative problems 
involved in the development of aptitude or achievement tests. Although 
written primarily for those persons interested in test construction, this 
book contributes to the solution of problems faced by workers in such other 
areas as appraisal of personality, measurement of interests, attitudes, or 
opinions, and clinical diagnosis. 

Gulliksen has divided his presentation into two parts, each of about the 
same length. The first part is primarily theoretical, and involves the develop- 
ment of an ideal model to provide measures of accuracy of test scores and 
to assay the effects on such measures of changes in test length and in group 
heterogeneity. His basic equations are first derived from a definition of 
random error, and then from a definition of true score. Various measures 
of error are then developed and interpreted. The effect on other test pa- 
rameters of doubling test length, and the effect of test length on mean and 
variance, on reliability, and on validity are explored in successive chapters. 
The effects of group heterogeneity and of explicit or incidental selection 
on test reliability and validity are then presented. 
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The second part of the text deals with various practical problems of 
test construction, and includes development of a statistical criterion for 
parallel tests, methods for estimating test reliability, the problems and 
peculiarities of speed versus power tests, the attributes of various kinds of 
test scores, and methods of standardizing and equating scores. The problems 
encountered when several test seores are available for use in combinatior, 
and the effects of using different methods of weighting these scores are 
discussed, followed, in the last chapter, by a discussion of item analysis 
procedures, and the development of a rationale fir such analysis. 

A bibliography of over five hundred titles is included. Appendices supply 
equations from mathematics and statistics used in test theory, a table of 
ordinates and areas of the normal curve, sample questions for examinations 
in background statistics and test theory, and answers to the problems 
presented at the end of each chapter in the text. 

The nature of the subject matter requires the author to devote a con- 
siderable part of his presentation to the development of basic formulas. 
Most of the text, however, can be followed and understood by any person 
who knows the fundamentals of elementary algebra and statistics and who 
has some acquaintance with analytic geometry. In those chapters where 
additional background is required for following the derivations, Gulliksen 
has stated his assumptions and his conclusions in such a way as to produce 
understanding of the general principles involved. It is apparent that the 
author has attempted, with marked success, to present his material so as 
to promote the use of the principles stated, and to provide an understanding 
of the reasoning and the mathematical bases of test theory. His lucid chapter 
summaries, and his frequent provision of schema for preparing computing 
diagrams give evidence of considerable work expended to attain this objec- 
tive. 

While most of the book is devoted to a review of existing literature, 
Gulliksen has incorporated materials of his own which are available for the 
first time. His chapter on speed versus power tests includes much that is 
new, and that should prove especially useful in determining time limits 
to be set for tests that are partially speed, partially power tests. The analysis 
of the effects of differential weighting of scores used in combination is a 
real contribution as a summary and integration of tle work of many con- 
tributors. The chapter on item analysis is a distinct contribution, both in 
terms of its description of the effects of various item seiection methods, and 
its provision of a penetrating summary of the unsolved problems in this 
field. 

This book is a valuable addition to the field of test theory, and to the 
library of any person who is concerned with problems of test development 
or prediction. The author has done a very good job of giving adequate 
coverage to both the theoretical and the practical aspects of test construc- 
tion and analysis. Few errors in formulas and citations were found. While 
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writing primarily from the standpoint of one concerned with aptitude and 
achievement testing, Gulliksen has stated his principles in general terms 
which may be applied to the problems of workers in almost any field of 
measurement. 


Statistical Calculation for Beginners, Second Edition. LZ. G. Chambers (Assistant 
Director of Research in Industrial Psychology, Cambridge). Cambridge: Uni- 
versity Press, 1952. Pp. xii, 168. $2.50. 


Lye V. Jones, University of Chicago 


= is an elementary exposition of methods of calculation, addressed 
chiefly to students in psychology and the biological sciences. The book 
demands of its reader no mathematical ability other than arithmetic. As 
in the earlier first edition, most of the common descriptive and inferential 
statistics are discussed. The value of this second edition is enhanced by the 
inclusion of several new topics, notably analysis of variance, use of the 
binomial distribution, Kendall’s tau, selected methods for curve fitting, and 
a chapter on methods for the statistical treatment of judges’ rankings and 
paired comparison data (coefficients of consistence, agreement, and con- 
cordance). 

As a simple guide to statistical computation, this book is excellent. 
Presentations of computational schemes are succinct and well written. 
While the reviewer would have liked to find the inclusion of a greater number 
of non-parametric tests, the book is satisfactorily comprehensive. In most 
respects, it is exceedingly comprehensible. Introduction of each major topic 
is accompanied by appropriate examples (45 examples are included in the 
text), and exercises are provided with answers. Of particular interest is 
the Appendix, containing a number of labor-saving tables of aid in com- 
puting Spearman’s rho, Kendall’s tau, the coefficient of agreement u, and 
the coefficient of concordance W, among others. Most of these are published 
here for the first time. 

The second edition is superior to the first with respect to the appearance 
of assumptions essential for interpreting statistical results; e.g., assumptions 
underlying the use of Students ¢ have been incorporated. However, there 
still is insufficient reference to methodology of sampling and of experimental 
inference. In his introductory chapter, Mr. Chambers stresses the need for 
understanding limitations imposed on the use of statistics by fundamental 
theory and assumptions. That stress is not reinforced by ideas presented 
(or sometimes not presented) in later chapters. 

*}While tests of significance are exemplified in early chapters in connection 
with mean differences, differences between proportions, correlation coeffi- 
cients, etc., there is no mention of a test of the null hypothesis until relatively 
late in the book. In the chapter on x? the notion is introduced: “This latter 
hypothesis (that of no association between two variables), known as the 
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null hypothesis, assumes that two variables are not associated: if we can 
disprove a null hypothesis, it follows that the two variables must be associ- 
ated.” The impression a reader might get from the brief discussion in this 
section is erroneous in several respects. Certainly the null hypothesis is not 
always an hypothesis about degree of association between variables. A more 
telling criticism should be levelled against the implication that a null 
hypothesis can be so positively disproven that the alternative hypothesis 
must obtain. There appears little to counteract this implication in subse- 
quent examples, where, if a statistic is significant at the .05 level, we are led 
to conclude, flatly, “the null hypothesis is disproved.” 

The author is to be commended for the very understandable presentation 
(page 34) of the distinction between one-tail and two-tuiled tests of hypothe- 
ses. One searches in vain, however, for references to this distinction later in 
the book. As a result, ambiguities appear. Particularly puzzling are the in- 
structions offered for testing significance of rank correlation coefficients. 
A two-tailed ¢ test is utilized for the test of Spearman’s rho whereas the 
significance of Kendall’s tau is evaluated by considering only one tail of the 
normal curve. Neither in these illustrations nor in other ambiguous ones is 
the rationale made explicit. Incidentally, a correction for continuity is 
incorporated in the test for significance of tau as if the coefficient were always 
a positive number. It can, of course, take on negative values. 

A few other criticisms might be noted briefly. (1) The median is defined 
as the middle observation and the mode as the most fashionable observation. 
In examples, however, these statistics are determined correctly, not as 
observations, but as points along continua. (2) The mean deviation is 
defined as the sum of differences between observations and the median, a 
unique definition, to say the least. (3) The coefficient of variation is presented 
as a statistic useful for comparing the scatter of different variables about 
their means. In a book designed primarily for psychologists, it would seem 
important to indicate that such comparisons are legitimate only when meas- 
urement is based upon scales with absolute or true zero points. Psychological 
test scores typically exhibit an arbitrary origin, and the coefficient of vari- 
ation is not applicable for use with such data; for this reason Exercise 10, 
page 23, provides a misuse of the statistic. (4) The treatment of product- 
moment correlation would be of greater value if it were to include discus- 
sions of possible interpretations of r, of assumptions which underly the 
interpretations, and of factors which may affect the size of r. (5) There 
appears no reference to the assumption of additivity which underlies the 
interpretation of F in analysis of variance, nor is there introduced even a 
hint of the possibility of replication in factorial design. In psychological 
research, where individuals often are treated as experimental variables, 
significant interactions typically are present. The most rigorous tests for 
interaction, particularly in analyses involving two-way classification, 
demand an estimate of error variance based upon replication. 
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Index Numbers. Bruce Mudgett. New York: John Wiley and Sons, Inc., 1951. 
Pp. x, 135. $3.00. 


J. M. Goutp, Market Statistics, Inc. 


gage the general title, Professor Mudgett has centered his attention 
on the characteristics and weaknesses of the fixed-base index number 
formula in the measurement of price movements. The chief argument 
brought to bear against the Laspeyres base-year aggregative fromula, 
p » 7190/>_, Poo, and the companion Paasche given-year formula, > ,p:9:/> pom, 
is that it is illogical to use any formula whose weights do not reflect the two 
periods involved in binary comparisons. This leads him to the choice of a 
chain index number in which, for each binary comparison, both the given- 
and base-period weights are “crossed,” either arithmetically, as in the 
Marshall-Edgeworth formula, > Pi(qo+q) />po(qo+a); or geometrically, as 
in the formula Yn Vaon/>. po V dog. 

In addition to avoiding the logical error involved in the use of constant 
weights, the chaining device would permit the inclusion in the computation 
of additional commodities not present in either terminal point of the series, 
thereby broadening the coverage of the index as much as possible. While 
conceding the cumulative error to which chain indexes are vulnerable, 
Mudgett feels that a corresponding degree of error affects results yielded 
by the fixed-weight formula. The only virtue possessed by the latter, aside 
from ease of computation, is a conceptual simplicity which, however, the 
author feels is spurious. 

While Mudgett is unusually vehement in his evaluations, all this will 
come as no surprise to students of index numbers who must see in any 
formula a choice or compromise between evils. For instance, the author 
does not contend that his crossed-weight chain formula offers any real solu- 
tion to the problem of measuring price movements over long periods in which 
the entire price and quantity structure undergoes great change. Yet it is 
precisely in such cases that the cumulative error of a chained series would 
appear to be most dangerous. The suggestion offered by Wesley Mitchell 
in The Making and Using of Index Number: still appears to this reviewer a 
sensible procedure for beth short and long term comparisons, i.e., to calculate 
the results with both a chain index and a fixed base index. Attention can 
then be paid to the difference yielded by the two approaches, and when the 
difference is large an average can be struck in the absence of any reason to 
favor one resu!t over the other. This procedure, incidentally, underlay the 
indexes of output constructed at the National Bureau of Economic Research 
in its series of monographs on output and productivity in American industry 
since the turn of the century. 

Mudgett concludes his book with some rewarding chapters reviewing the 
development of the Bureau of Labor Statistics indexes of wholesale prices 
and consumer prices, presented as case histories of the evil consequences 
of reliance on constant weights. The author feels that his strictures against 
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constant weights are particularly borne out by the violent attacks made 
in recent years on the BLS index by labor groups. The current use of the 
consumers’ price index as a basis for wage adjustments requires, he states, 
“constant revision of commodity lists and weights.” While this is perfectly 
true, it must be noted, too, that the labor criticism of the BLS index goes 
far beyond the technical problems of changing commodity coverage and 
choice of index number formula. The question that has yet to be answered 
is whether the use of the BLS index, even in its present “improved” form, 
is really warranted for adjusting wages to changes in the “cost of living.” 
This was in part recognized by the BLS when the “cost of living” title was 
dropped. Changing names, however, has not relieved the BLS of the re- 
sponsibility that has been increasingly forced on it. It is to be regretted that 
Professor Mudgett has not explored some of the broader issues involved in 
the use of the BLS index in current wage negotiation policy. 


Application of Linear Programming to the Theory of the Firm. Robert Dorfman 
(Assistant Professor of Economics, University of California, Berkeley). Berkeley 
and Los Angeles: University of California Press, 1951. Pp. ix, 98. $3.50. 


Rosert M. Sotow, Massachusetts Institute of Technology 


| Fey programming, which was launched only a few years ago in a flurry 
of dittoed memos from such places as the Air Force, the Rand Corpora- 
tion, and the Cowles Commission, is beginning to have a literature of its 
own. Dorfman’s little beauty of a book is a most welcome addition. It 
provides a compact, readable introduction to this new technique, which is 
now within reach of graduate courses in economic theory. The ever more 
commonplace matrix notation is employed, but this can be swallowed or 
circumvented. For the rest, the book is a model of expositional clarity. 
It illustrates the new techniques in application to the clessical economics 
of the firm, and Dorfman eschews mathematical frills to keep the economics 
always to the fore. At the same time, the treatment is sufficiently rigorous. 

Constrained extremum problems are in a way the bread and Lutter of 
economic theory. Linear programming began as the study of the maximiza- 
tion of linear functions, constrained by linear inequalities. A special tech- 
nique is needed because the usual calculus methods fail. A linear function 
rises forever if it rises at all; its extrema will be found on the boundaries 
of the region delimited by the constraints. Thus instead of the standard 
marginal conditions, linear programming and its best known numerical 
procedure, the “simplex method” of Dantzig, resort to a more direct ap- 
proach, amounting to enumeration of possibilities. One goes to the bound- 
ary and moves along it, ever uphill. The mathematics guarantees that the 
process terminates at the peak, in a finite number of steps. Recently Kuhn 
and Tucker have extended the methods to the maximization of non-linear 
functions of a wide and useful class. Dorfman employs some of these recent 
developments. 
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The book begins by recasting the traditional model of production in a way 
made familiar by Leontief’s input-output system, but more general than the 
latter. Instead of operating a production function showing a certain output 
for each possible combination of inputs, the firm has at its disposal a finite 
number of processes. The process is the basic concept: it is a way of trans- 
forming inputs into outputs (joint production is welcome in this scheme). 
The fundamental assumptions about processes are three in number: (1) 
the mutual proportions of inputs and outputs are fixed and independent 
of the scale at which the process is operated; (2) subject to resource limita- 
tions a process may be operated at any level or scale, large or small; (3) if 
two or more processes are operated simultaneously, their inputs and outputs 
are simply additive, all passes as if the processes were hermetically sealed 
from each other. The object of a firm is to choose the levels at which it 
shall operate the processes within its technology so as to maximize money 
profits. Not all available processes need be used. 

Chapter II uses this model to study a firm whose product and factor 
markets are perfectly competitive. Since variable inputs are bought and sold 
at constant prices, we may choose as the unit level of a process the level 
which brings $1 of net revenue. It is at once apparent that there is so far 
no limit to the profits of the firm so long as it has one profitable process. 
This is the old problem of the indeterminateness of the size of the firm in a 
competitive industry with constant returns to scale. This difficulty is usually 
settled, uneasily, by Kaldor’s device of considering “entrepreneurship” as a 
fixed factor. Dorfman matter of factly assumes from the start that the firm 
has a number of resources or fixed factors over which it disposes in strictly 
limited amounts. These provide the bound to size of firm. Under unrestrictive 
assumptions Dorfman proves Dantzig’s theorem that the number of proc- 
esses actually operated (including “disposal processes”) will be equal to the 
number of scarce factors. The optimal production program is established, 
and, what is very characteristic of linear programming, a method of calcu- 
lating the optimal program is provided. Finally the connection between the 
so-called “dual problem” to the maximization and the imputation of prices 
to the fixed factors is explained. Fixed factors not fully used turn out to be 
free goods. 

The third chapter analyzes from the same point of view a monopolistic 
firm facing falling demand curves. Here total revenue is quadratic and modi- 
fied marginal methods apply. Dorfman very ingeniously gets around the 
fact that only one-sided marginal conditions apply to processes not actually 
used. Even without fixed factors the optimum problem has a solution when 
demand curves fall, but in a final section on “quadratic programming” 
Dorfman is able to introduce fixed factors into the monopolistic firm and 
solve the valuation problem. Again stressing the significance of programming 
as an administrative and empirical tool, computational methods are given 
and in fact many results follow constructively from the technique for locating 
an optimum. 
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In the last chapter Dorfman indicates some of the broader uses to which 
his methods may be put. They extend beyond the bounds of the single firm 
to administrative and planning problems involving whole sectors of the 
economy. An important direction for future research is the development 
of probability and statistical methods appropriate to this particular model. 
We need techniques for estimating unknown parameters, and ways of intro- 
ducing random components directly into the production and price relations. 
This chapter also contains some reflections on the realism and relevance 
of the restricted model of production used. My impression is that Dorfman 
concedes too much to the critics of the profit-maximization principle—he 
neglects, for instance, the “survival-value” of high profits. On the other hand 
he seems to concede too little to the criticisms of additivity, constant 
returns, constant costs. This is not the place to enter into an old argument, 
but Dorfman appears quite cavalierly to rule out any possibility of interac- 
tions among processes. Interactions may be essentially bound to scale of 
operations and so not conceivable as a new, different process. Similar re- 
marks apply to the old debate over constant returns. 

This is a short but sweet book. One may hope it will have wide circulation. 
The price of $3.50 for 98 varityped pages seems to indicate the publisher’s 
belief to the contrary. Surely this is too high a price for a book which might 
be sold to graduate students. 


Income and Wealth, Series I. Edited by Erik Lundberg. Cambridge, England: 
Bowes and Bowes, 1951. Pp. xv, 297. 30 s. 


L. R. Kier, University of Michigan 


HE International Association of Research in Income and Wealth was 

established in 1947, and held its first conference at Cambridge, England, 
in 1949. The volume under review consists of a selection of eight of the 
papers presented, prefaced by Erik Lundberg’s introductory remarks. The 
papers are of two types, one dealing with general problems of national 
accounting and the other with the use of national accounting schemes in 
formulating economic policy in each of five countries. 

Among the first group we find “Functions and Criteria of a System of 
Social Accounting” by Richard Stone, “Government Product and National 
Income” by Simon Kuznets, “Intertemporal Comparisons of Real National 
Income: an International Survey” by J. B. D. Derksen, and “Some Problems 
in the Measurement of Changes in Real Geographical Product” by W. B. 
Reddaway. The titles adequately indicate the subjects covered. 

Stone, with his customary elegance, lays out the general properties that a 
social accounting system should have and gives an illustrative example 
from British data for 1948. He develops operating (production), appropri- 
ation (consumption), and resting (adding to wealth) accounts for businesses, 
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households, public collective providers, labor service,! home lending service,! 
foreign lending service,' and the rest of the world. The reviewer feels that 
an unclear picture of economic processes is given by his treatment of house- 
holds as a group whose products are not intended for sale. It is difficult to 
envisage households not sellers of labor services. Similarly one might object 
to Stone’s inclusion of taxes in government appropriation accounts instead 
of operating accounts. 

Kuznets deals with the problem of defining the government contribution 
to national income. The main problem is to allocate government activities 
between the provision of final and intermediate products. Government 
accounts are now becoming so large that a thorough consideration of this 
problem is urgent. The discussion is competently presented with the end 
in view of using national income estimates as a measure of welfare. National 
income and related measures may, however, be used for a variety of pur- 
poses. What would be an appropriate measure of the government contribu- 
tion if the analysis of inflationary and deflationary tendencies were the objec- 
tive? The paper does not go into such problems. 

Derksen surveys measures of real national income in various countries. 
He discusses three methods of measurement—deflation of current value 
estimates by a cost-of-living index, deflation of national product by sub- 
groups, and formation of a weighted average of production indexes. He 
also considers the more general valuation problems as exemplified by the 
well-known discussion (in Economica) between Hicks and Kuznets. A more 
interesting but difficult approach to social accounting in real terms would 
seem to be the preparation of a complete set of accounts in constant prices. 
This would entail the revaluation of both income and expenditure av. sunts 
simultaneously. Balancing items like savings and profits may thus bear the 
brunt of price adjustments of current value data. 

Reddaway is interested in extending the concept of an industrial produc- 
tion index to the whole economy; i.e., he would obtain a measure of real do- 
mestic production. He uses the United Kingdom as a concrete example for 
the presentation of his problems. He wants to measure net instead of gross 
real output and adopts Geary’s approach of striking the difference between 
output and input valued in constant prices for each industry. He recognizes 
the practical difficulty of carrying out this approach in every instance. 

The other group of papers draws on results of applications in various 
countries. The papers are “Experiences in the Use of Social Accounting in 
Public Policy in the United States” by Gerhard Colm with an appendix on 
“The Nation’s Economic Budget” by Mary W. Smelker, “Les Expériences 
Récentes de l’Emploi de la Comptabilité Sociale par la Politique Economique 
en France” by Jan Marczewski with an English summary, “The Recent 
Use of Social Accounting in the United Kingdom” by E. F. Jackson, and 
“Recent Experiences in the Use of Social Accounting in the Netherlands” 
by G. Stuvel. 


1 The resting account is absent. 
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Colm traces the history of development and uses of national income sta- 
tistics in the United States. He also discusses the structure of the nation’s 
economic budget appearing in the President’s economic reports and its 
usefulness in stabilization policy. Mrs. Smelker’s contribution deals with the 
technical problems of preparing the nation’s economic budget. 

Marczewski is concerned with the historical development of social 
accounting in France and a consideration of its influence on policy. An 
interesting attempt was made to ascertain the effects on policy by means of 
& survey posing questions to those responsible for policy. He also gives an 
analysis of recent French economic developments in terms of national 
accounts. 

Jackson describes the origins of social accounts in Britain, which were 
largely due to the insistence of Keynes, in contrast with the present informa- 
tion consisting of operating and appropriation accounts of firms, the revenue 
account of households,? the revenue account of public authorities, a combined 
capital account, and a rest-of-the-world account. He singles out perhaps the 
most important aspect of social accounting in relation to public policy by 
his remark, “It is unnecessary to enlarge on the value to clear thinking of the 
systematic presentation in a logical system of accounts of the past transac- 
tions of an economy.” Jackson does not try to read more into the double- 
entry accounting statements than is really contained therein. He shows 
how attempts are made to supplement the bare accounting records with an 
interrelated model of behavior and other relations. However, his model 
approach is rough and intuitive. 

Stuvel presents the excellent and easily understandable system of Dutch 
social accounts. Their system consists of four types of accounts, income, 
product, foreign, and capital. The economy is also divided into four sectors— 
personal, collective income recipients, business and public. This system of 
social accounts is widely used in Dutch central economic planning especially 
for drawing up a confrontation of economic needs by means over the plan 
period. The interesting thing about Stuvel’s paper is that he sees the function 
of the social accounts clearly to provide the balance equations of the economy, 
and with the idea of a complete econometric model before him argues that 
more fruitful results must rest upon the establishment of a link between the 
accounting equations and the structural equations of a complete model. 

The initial volume of the International Association has dealt interestingly 
with important problems and forms the groundwork for the development of 
a valuable new series. 





2 Revenue accounts are consolidations of operating and appropriation accounts. 
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Triple Audit of Industrial Relations. Dale Yoder, Herbert G. Heneman, Jr., 
and Earl F. Cheit. Bulletin 11, August 1951, Industrial Relations Center, Uni- 
versity of Minnesota. Minneapolis: University of Minnesota Press, 1951. Pp. vii, 
77. $1.00. 


GunNTHER T. STEINBERG, Western Electric Company 


HIS bulletin describes the background, point of view, and preliminary 

findings of a long-term and continuing series of industrial relations 
research studies carried on by the University of Minnesota Industrial 
Relations Center. This series of studies has been called the Triple Audit 
of Industrial Relations because it originally attempted to determine how 
the following three types of industrial relations factors in individual business 
firms are interrelated and how they conform to estabiished bench marks: 
(1) economic characteristics of employers, (2) industrial relations and 
personnel practices of employers, and (3) employee opinion and reaction. 
As the studies have progressed, however, the authors have given considera- 
tion to many other factors, both economic and non-economic in nature, 
that may have a significant infduence in employment relationships. Conse- 
quently, the authors feel that their studies may well be called the Multiple 
Audit of Industrial Relations. 

The authors go on to say that the basic objective of industrial relations 
research is to provide a description and explanation both of the process of 
labor allocation within and between business firms and of the factors that 
affect the utilization of manpower resources after the allocation has taken 
place. With this objective the reviewer is in hearty agreement, for the attain- 
ment of the objective may provide information that may not only enable 
economic theorists to think more realistically about labor as a factor of pro- 
duction, but may also aid individual business firms in the efficient allocation 
and utilization of manpower resources. However, as will be explained later, 
this reviewer is somewhat disturbed over some of the information that the 
authors have either considered or failed to consider in the process of working 
toward their laudable objective. 

To begin with, the bulletin consists of four parts: Part One, Industrial 
Relations Research and the Triple Audit; Part Two, Pilot Studies Showing 
Application of Triple Audit Research Design; Part Three, Findings from the 
Triple Audit Pilot Studies: 1948-1950; Part Four, Unanswered Questions 
for Future Research; Appendix A, Questionnaire on the Economic Charac- 
teristics of the Firm, Questionnaire on the Industrial Relations and Personnel 
Practices of the Firm, Questionnaire on Employee Opinion; and Appendix 
B, Triple Audit Sociometric Analysis. 

In Part I the theory and design of the Triple Audit studies is discussed. 
The authors maintain that industrial relations research, because of the 
nature of the human resource, will not be fruitful unless it is based on a 
set of principles somewhat similar to the following principles incorporated 
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in their Triple Audit research: (1) Consideration must be given to the con- 
tributions of all the social disciplines—economics, psychology, sociology, 
anthropology, etc.—and not of economics alone; (2) Studies of individual 
business firms must be so designed that their results can be integrated with 
the results obtained by other investigators, thereby permitting additive 
progress; (3) Whenever possible, industrial relations factors must be meas- 
ured in quantitative terms rather than in qualitative terms; and (4) Less 
emphasis must be placed on pure theorizing and more emphasis on empirical 
investigation and on verification of relationships between industrial relations 
factors. With these general principles the reviewer finds himself in basic 
agreement. More detailed information on the actual application of the 
Triple Audit research design to individual business firms is contained in 
Part II of the bulletin. Comprehensive Triple Audit studies were carried 
out over the two-year period 1948-1950 for only nine firms—two in trade 
and seven in manufacturing, and all of them in the Minneapolis-St. Paul 
area. With a list of all employees for each of the participating firms at hand, 
the firm samples (varying in size from about 50-200 cases) were determined 
by selecting every “nth” name on the various lists. The combined sample of 
the various firm samples covered approximately 1,200 employees, from whom 
employment relations information was collected by the use of various scales, 
check lists, questionnaires, interviews, etc. 

This employee information, together with that obtained from the indi- 
vidual business firms, was analyzed primarily by such statistical techniques 
as Student’s é-test, the chi-square test, correlation, standard error of the 
difference between two means, R. A. Fisher’s analysis of factorial designs, 
etc. The various hypotheses about industrial relations factors to be tested 
by these statistical techniques were chosen from the literature on the basis 
of their frequency in print, their importance, and their adaptability to 
statistical testing. Each hypothesis used in the studies related two particular 
factors that were regarded as significant in conditioning some characteristic 
of employment behavior. The various single relationships between a major 
industrial relations factor and several “contributing” factors were then 
grouped to show the gross effects of these “contributing” factors operating 
simultaneously on the major goal factor. Among the major factors selected 
for study in the Triple Audit investigations are the following: grievance 
ratios, labor turnover, profits, costs, scores on employee attitude scales, 
union security, take-home earnings, and hours of work. 

Part III of the bulletin presents the findings from the Triple Audit pilot 
studies. The first series of findings compared factors in from 5-7 manufac- 
turing firms. Here the firm was used as the basic unit of comparison because 
the relationships were between factors applying to a firm as a whole. Since 
the number of firms invoived was too small to show how the results obtained 
would be expected to differ from chance findings, no statistical tests were 
made of inter-firm comparisons. 

In the second series of findings the individual employee was used as the 
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basic unit of comparison irrespective of the particular firm from which he 
was selected. In every case for which results were given, individual employee 
attitude scores were compared with such individual employee characteristics 
as age, sex, number of years of service with the firm, number of years on 
the present job, etc. Since this group of findings was based on many individ- 
uals, the significance of the simple relationships was tested either by the 
use of the standard error of the difference between means or by the 
chi-square test. Observed relationships were stated to be “statistically 
significant” at or below the 5 per cent level of significance and “highly 
significant” at or below the 1 per cent level of significance. In the presenta- 
tion of the simple relationship between a pair of industrial relations factors, 
each factor was dichotomized. In some cases the division was on the basis 
of above and below median or mean values for all firms included in the 
survey. In other cases the basis for dichotomization was presence or absence 
of the factor in question. 

Part IV of the bulletin points up the fact that the authors are well aware 
of many of the shortcomings of their research in industrial relations. The 
reviewer, however, wishes to emphasize certain shortcomings that he con- 
siders particularly important. 

In the first place, the authors fail to obtain any information on the 
straight-time basic rate of pay by occupations in the case of all employees 
and the straight-time average hourly earnings by occupations in the case 
of workers operating under incentive payment plans. The only information 
on basic rates of pay and occupations that the authors obtain in their 
employee opinion questionnaire is that on take-home earnings (whatever 
that means) and such broad type-of-work classifications as sales, clerical, 
production, etc. If the basic objective of industrial relations research is 
to explain the process of labor allocation and utilization, as the authors 
have asserted, then the reviewer considers the individual basic rate of pay 
information by occupations as important. 

With this basic occupational information at hand, the investigator may 
then broaden the scope of his research by studying also such relationships 
as the extent to which the variation in basic rates of pay is explained by such 
individual employment relations factors as occupation, performance, edu- 
cation, length of service, age, sex, etc., and by such group and environmental 
factors as group performance, labor union activity, market position of the 
firm, employee benefits and services provided by the firm, etc. To ignore 
basic rates of pay by occupations is to ignore some of the most useful and 
practical aspects of industrial relations research. Furthermore, failure to 
consider wage rate information by occupations may make it impossible to 
interpret properly the observed relations between other important employ- 
ment relations factors, since the occupational distribution of employees as 
well as the rate-of-pay distribution of employees within an occupation may 
vary drastically both in intra-firm comparisons over a period of time and in 
inter-firm comparisons as of a moment in time. 
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Somewhat related to the above criticism is the fact that the authors have 
not incorporated any measure of employee productivity int« their studies. 
Failure to do so eliminates another whole area of useful and practical indus- 
trial relations research. The authors realize the importance of this omission 
and hope in the future to use sociometric techniques in order to obtain at 
least crude measures of employee productivity. Actually, such variables 
as job performance scores assigned by supervisors, the ratio of “should- 
take” hours (based on some standard) to actual hours to perform a specific 
phase of production, actual count of units of production, etc., are now avail- 
able to some extent in industry. It is the reviewer’s opinion that this type of 
information, related more or less to productivity, should be utilized to the 
utmost in those firms where it is found and should not be ignored, as the 
authors have done, just because comprehensive productivity data were not 
found in all the firms studied. 

Another criticism to be made centers on the fact that the authors fail to 
present any information on the partial or net relationships between industrial 
relations factors resulting from the simultaneous consideration of the inter- 
relationships among several factors. No reason was given for focusing atten- 
tion solely on the simple relationships between pairs of industrial relations 
factors. No doubt the paucity of data accounted for this in those cuses where 
the business firm was used as the basic unit of comparison, but the paucity- 
of-data argument cannot always be called upon to explain the lack of 
treatment of multiple relationships in the cases where the individual em- 
ployee was used as the basic unit of comparison. From the point of view of 
achieving the basic objective of industrial relations research, the important 
findings are not the simple relationships between factors but rather the 
multiple and partial relationships, for it is quite possible that a marked 
positive simple relationship between two factors may well change to a 
marked negative net relationship when several factors are considered 
simultaneously. 

An additional criticism of the Triple Audit research studies centers around 
the rather frequent lack of homogeneity and comparability with respect to 
the systems of classification and to the units of measurement employed. 
This criticism is in direct opposition to the authors’ assertion that their 
data are comparable as between firms in that they were gathered using an 
identical frame of reference and identical scales and measurements. For 
example, the authors apparently assume that factors such as the following 
have comparable significance for the same firm over a period of time and for 
a group of firms as of a moment in time: cost of goods sold, direct labor 
costs, indirect labor costs, common labor rates, break-even point as per cent 
of capecity, and book value of assets. Furthermore, in the classification of 
firms by various factors, the authors evidently consider the various degrees 
of competitive rivalry in the market of such small importance that they 
ignore it entirely. Moreover, with respect to certain scales, such as those 
related to employment stabilization and communication with employees, 
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the authors apparently assume that the mere use of a larger number of 
methods is indicative of a higher level on the scale, irrespective of the 
intensity with which any one method is employed. : 

There are other shortcomings of the Triple Audit studies that cannot 
be covered here. For example, the reviewer is convinced that much informa- 
tion is sacrificed by using class intervals that are too broad for some of the 
original data (earnings, hours, age, service, etc.) and by dichotomizing the 
various industrial relations factors in the presentation of the findings. 

In conclusion, the reviewer wishes to comment briefly on an impression 
given by the authors about the manner in which they assign significance 
and causality to various industrial relations factors on the basis of modern 
statistical tests of significance. The only thing that apparently keeps the 
authors from considering “significant” variations from chance as proof of 
causal relationships between industrial relations factors is the fact that the 
units of comparison were so few in number. Even if the number of firms 
was increased considerably, however, there would still be no justification 
for drawing such conclusions on the basis of significance tests alone, because 
of the shortcomings of the Triple Audit studies discussed above. In applying 
R. A. Fisher’s methods of analysis of variance and covariance, we cannot 
hope to emerge with useful results if the basic data to which these methods 
are applied have been obtained by elastic scales and units of measurement, 
are the result of non-uniform definitions and classifications, and can reason- 
ably be assumed (on the basis of information not included in the significance 
tests) to be influenced by other factors totally ignored in the study. 

Furthermore, in dealing with data on economics and industrial sociology, 
the assignment of causality to particular factors is generally considerably 
more hazardous than it is in such research as, say, agronomy. This is due to 
the fact that economics and social statistics generally do not arise from 
designed experiments, where the investigator actually applies treatments 
with the aid of the principles of randomization and replication so as to arrive 
at an unbiased and reliable estimate of sampling variance, but rather are 
generally placed in the format of an experimental design by the investigator 
after the interplay of the various factors in a particular situation has taken 
place. 

As a consequence of these conditions, information not considered in the 
tests of significance may at times be of such a nature that the investigator 
should give it so much weight as even to reverse the findings based on 
significance tests alone. This type of problem needs further consideration 
in the Triple Audit studies if findings therefrom are not to be misleading. 





340 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


Unionism, Wage Trends, and Income Distribution, 1914-1947. Harold M. Levin- 
son. Ann Arbor: University of Michigan Press, 1951. Michigan Business Studies, 
Volume X, Number 4. Pp. 122. $2.00. 


Peter O. Sterner, University of California (Berkeley) 


HE explicit purpose of this monograph is to attempt by statistical investi- 
gation to test the validity of alternative theories about the effect of 
union action on wages, prices, and employment. Such an attempt could prove 
useful in any or all of the following ways: 
1. In providing a meaningful test of the alternative theories. 
2. In collecting or developing data that are themselves of great interest. 
3. In suggesting interesting hypotheses for further investigation. 
Mr. Levinson makes a substantial contribution in the second of these 
ways, and, to a lesser extent, in the third. But statisticians, particularly 
those concerned with developing even moderate statistical sophistication in 
students of economics, will be particularly disturbed by the failure of the 
author to accomplish the first objective because of an apparent lack of 
understanding of what is necessary for a meaningful structure of testing. 
They will be further disturbed by the nature of some of the inferences 
from empirical data that they find here. Because these failings seem to me 
to be exceedingly important and, sadly, not infrequent in economic em- 
pirical work, this review will focus heavily on them, even at the expense of 
failing to do justice to the wealth of important and useful data presented. 
Levinson identifies two classes of wage theories, the “bargaining” theories, 
and the “non-bargaining” theories.' The former “imply that union bargaining 
power may raise wage rates within a significant range without causing some 
type of price-employment reaction on the part of the producer” (page 17); 
the latter imply the reverse. “The issue is not whether unionism can raise 
wages, although this may be a necessary first step, but rather, whether 
wage rates can be raised without affecting prices, output, and employment” 
(p. 19.) The major issue for testing is stated a few sentences later “Wide- 
spread unionism, therefore, assuming the bargaining theory to be generally 
applicable, would result in significant shifts in the distribution of the national 
income in favor of labor and at the expense of profits, both in the individual 
industries which are unionized and in the economy as a whole” (p. 20.) 
This proposition is tested by comparing functional distributive shares of 
private national income over the period of 1919-1947. This examination 
leads Mr. Levinson to conclude “In view of the fact that the evidence discussed 





1 The bargaining theories are represented by the anti-marginalists, specifically Hall and Hitch, and 
by the textbook development of Lloyd Reynolds. The non-bargaining theories are represented by the 
classical marginal productivity theory and by the “markup” theory (again Hall and Hitch). The range 
of theories and the examples of each seem unfortunately narrow. It was surprising to this reviewer to 
find, for example, that the most sophisticated of the bargaining theories—as developed by Ross in Trade 
Union Wage Policy—was not included. Similarly, the only awareness I found of the work of Dunlop 
and Reder was in the bibliography. 
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above indicated that there was no significant reduction in the share of income 
going to profits, despite extensive increases in money wages, the conclusion is 
clear that the marginal productivity and markup theories appear to offer a far 
more accurate representation of the actual functioning of the labor and product 
markets than do the various versions of the bargaining theory” (p. 115, italics 
mine). 

Quite apart from the validity or adequacy of the major issue, the notion 
that the historical comparison of distributive shares provides evidence 
about the validity of the theories is quite naive. It is one thing to reason 
about the effect, ceteris paribus, on income distribution of the ability of 
unions to “raise wage rates within a significant range without causing 
some type of price-employment reaction on the part of the producer.” 
It is a totally different thing to argue that achieved changes in distributive 
shares provide evidence as to the results that might have occurred had 
ceteris paribus been a likely assumption; certainly for the period of 1919- 
1947 the assumption seems questionable. Indeed in view of some of the other 
variables that affect prices and profits—monetary policy, taxes, changes 
in the methods and composition of production, to mention just a few—I 
can think of no showing of the data on income shares that would not be 
consistent with either of the alternative kinds of theory presented. 

Fortunately almost half the monograph is concerned with presenting 
and analyzing data on the “Influence of Unionism on Wage Trends.” Data 
are presented in considerable detail for both unionized and non-unionized 
groups for the period 1914-1947, and are helpfully classified according to 
levels of wages and according to industries. These data go far beyond those 
generally available previously and are themselves quite sufficient reason 
for publication of the monograph. Further, Mr. Levinson’s analysis of this 
material is suggestive. His general view is that unionism can affect the level 
of wages subject to the influence of the level of employment and the govern- 
mental attitude, direct and indirect, toward labor. His view with respect to 
periods of high employment—that at such a level employer pressure will 
bid up wages with or without union pressure—seems particularly helpful 
in interpreting data of the kind recently developed by Rees for the coal and 
steel industries in the the postwar period. 

The less fortunate part of this analysis of the influence of unionism is 
perhaps partly stylistic. Levinson’s “conclusions” seem te me rather in the 
nature of hypotheses. He has the unfortunate habit of gliding from correla- 
tion to causation, from consistency of data with certain propositions to 
proof of validity of the propositions. Substantively these are perhaps not 
important matters—one will in any case treat “conclusions” as hypotheses 
with respect to further data; but they do seem important to those concerned 
with graduate instruction (the monograph is developed from a doctoral 
dissertation submitted in 1950). Both the formulation and the testing of 
hypotheses by the analysis of empirical data are highly useful endeavors; 
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realization of the full usefulness of such efforts requires however a greater 
degree of understanding of the processes of testing and of inferring from 
empirical data than is shown in this monograph. 


A Proposed System for Classifying Manufacturing Concerns by Size. James I. 
Mills. Washington: U. S. Department of Commerce and National Production 
Authority, June 22, 1951. Pp. iv, 90. Paper. 


G1pEoN RosENnBLUTH, National Bureau of Economic Research 


HE main object of this report is to develop criteria for determining which 

manufacturing firms are to be classified as “small business.” These criteria 
are needed to administer laws in which small business is singled out for 
special treatment, such as the Selective Service Act of 1948 and the Defense 
Production Act of 1950. 

The result, based on the 1947 Census of Manufactures, is really a system 
for classifying plants, using employment as the measure of size. (Firms 
with more than one plant are to be classified in the same way as a single 
independent plant having the same number of employees.) A plant classified 
as “small” may range from one with less than 50 employees to one with less 
than 2,500 employees, depending on its industrial classification. The upper 
limit of small size applicable to a given industry is defined as a size of plant 
such that one-third of the industry’s “value-added by manufacture” is 
produced in smaller plants. For industries in which the upper limit would be 
less than 250 employees the “one-third” rule is modified by substituting an 
increasing proportion as the upper limit declines. 

The reasoning on which these rules are based appears to be as follows: 
It is assumed to be the intention of Congress that “small business” should 
include both the firms that are small in some “absolute” sense, and those 
that are small in relation to other firms in the same industry, though they 
may be large in relation to firms in another industry. It should, moreover, 
exclude firms that are large in some absolute sense. The inclusion of all 
firms with less than 50 employees and exclusion of all those with over 
2,500 are intended to take care of the “absolutely small” and the “abso- 
lutely large” firms. The “one-third” rule and its modifications are intended 
to locate the firms that are “relatively” small. 

The figure of one-third is based on the fact that for manufacturing as a 
whole “value-added” per employee increases with size of plant, and when 
plants are arrayed in order of size, one-third of “value-added” comes from 
plants with less than average productivity in this sense. The explicit reason- 
ing by which the rules are derived from this finding is as follows: “On an 
industry basis, there is considerable variation in the proportion of total out- 
put coming from establishments in this category. Nevertheless, it has been 
assumed for purposes of analysis that one-third of the total industry output 
comes from small establishments except in those industries in which estab- 
lishments with considerably less than 250 employees do much more than 
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one-third of the manufacturing” (pp. 4-5). Further assumptions implicit in 
the reasoning and stated elsewhere in the bulletin are that value-added per 
employee measures efficiency and that it is the sense of Congress that special 
assistance should be given to small firms on account of their low efficiency. 

The author has tackled a problem which involves two almost overwhelm- 
ing difficulties. First, congressional intent regarding the characteristics of 
“small business” is very hard to fathom. Secondly, we do not know enough 
about the industrial structure and its functioning to implement any given 
interpretation of the will of Congress with much confidence. There is there- 
fore room for a great deal of reasonable disagreement, and, for the purpose 
of this review, I shall not quarrel with the author’s interpretation of con- 
gressional intent, nor with his determination of large and small plants in the 
“absolute” sense. I assume, however, that the congressional concern with 
small business involves, in part at least, a desire to foster a competitive in- 
dustrial structure; and that in a set of working rules designed to be explained 
to, and implemented by, people who are not specialized statisticians, sim- 
plicity is a merit. 

From this point of view the proposed rules for determining “relatively” 
small size are open to serious criticism. The simplest concept of relative size 
is size of the firm in relation to size of the industry, or to size of other firms 
in the industry. This would suggest the simple rule that a firm is to be clas- 
sified as “small” if it individually accounts for less than a given percentage 
of the industry’s employment. According to the proposed system, however, 
a plant is small if the total proportion of the industry’s value-added that is 
accounted for by all the plants that are no larger than the plant in question 
is less than a certain value (e.g., one-third); no consideration is given to the 
number of plants involved. Hence, when the same rule is applied to an in- 
dustry with a few plants and one with many plants, the relative size of the 
individual small plants in the former will be greater than in the latter. For 
example, a small plant is defined as one with less than 1,000 employees in 
the “files” industry and as one with less than 1,100 employees in “ship build- 
ing and repairing,” on the basis of the “one-third” rule. This means that a 
“small” plant can have up to 22.5 per cent of the industry’s total employ- 
ment in “files” (where there are 37 plants) but only up to 0.8 per cent in ship 
building (where there are 301 plants). (Sources are Table I of the report, and 
the Census of Manufactures, 1947, Vol. I, Ch. I, Table 2.) 

It is difficult to see why a plant with 999 employees which accounts for 
over one-fifth of an industry’s employment should be classified as “small.” 
If a complex method leading to such results is preferred to the simpler 
method mentioned above, adequate reasons should be given, but this is not 
done in the report. Moreover, it is the simpler concept which is directly re- 
lated to the number of firms, and hence to the competitive structure of the 
industry. 

The use made of “value-added” per employee to locate the upper limit of 
the “relatively” small plant must also be questioned, even if we accept the 
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assumption that Congress intended to protect small firms against the results 
of inefficiency. 

First, it is assumed that value-added per employee can serve as an index 
of efficiency, i.e., that it is inversely related to cost of production per unit of 
output when comparing plants of different size producing the same product. 
That there is little reason to assume this a priori can be seen by writing down 
the definitional components of value-added per employee and cost per unit 
of output. If there is empirical evidence demonstrating such a correlation, 
it has not been publicized. Even if profitability is considered instead of 
efficiency (though this does not appear to be the case in this report) there is 
little reason to expect a strong correlation. 

Second, it is assumed that value-added per employee rises with plant size 
in all industries. That this is not even approximately true can be seen by 
glancing at the plant size distributions in the 1947 Census of Manufactures. 
For example, in the apparel group value-added per employee falls from 
$8,627 in plants employing 1-4 workers to $3,121 in plants of 2,500 or more. 

Third, even for those industries in which value-added per employee does 
rise with plant size, the relative point at which it is equal to the average for 
the industry is widely dispersed, as the author himself mentions. There is 
no warrant for the assumption that in individual industries it occurs where 
one-third of value-added is in smaller plants. 

One is forced to the conclusion that the system of classification proposed in 
this bulletin is of a complexity that is not justified by the soundness of the 
assumptions and reasoning on which it is based. 


The report contains a voluminous and interesting appendix dealing with 
the number of plants per firm and the relationship between products and 
industries. 


The Malthusian Controversy. Kenneth Smith. London: Routledge and Kegan 
Paul, 1951. Pp. vii, 350. 30 s. 


George J. Sticuer, Columbia University 


ALTHUS, fortunate man, had scarce any acquaintances: everyone was 

friend or foe. Today he still has many friends but they seem seldom to 
be acquainted with the Essay on Population, so he has a fame through dis- 
tance and a reputation founded upon error. This confers upon him an in- 
vulnerability that is as infuriating as his doctrines to his critics. 

Smith, a, trenchant critic, reviews the controversy over the Essay 
during Malthus’ life. He awards almost all the points to the adversaries, and 
believes that it was an unfair public that awarded the contest to Malthus. 
So do I. The very literature that Smith reviews, however, helps to ex- 
plain Malthus’ success. Controversy is perhaps the only path to fame open 
to a mediocre mind. Malthus was very fortunate in his critics: they were 
numerous enough to publicize his doctrine and to get in each other’s way; 
and they were weak controversialists. They frequently used scurrility—a 
poor weapon for any but the greatest debater, who does not need it—and 
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they criticized every passage, which is foolish even when it is justified. 
Smith shares this latter fault: he never for a page conceals his dislike for 
Malthus—who is not likeable—and this reduces his effectiveness. 

The first half of Smith’s book is devoted to summaries of the doc- 
trines of the critics, and of Malthus’ occasional replies. One must agree that 
many penetrating criticisms were made, and not adequately answered, but 
on the whole I am less impressed than Smith with the quality of this litera- 
ture. The empirical and moral issues were confounded beyond repair: this 
was perhaps inevitable when the central argument turned on checks to 
population called misery, vice, and moral restraint. The comprehensive, 
almost indiscriminate, summaries contain innumerable wholly superficial 
arguments; for example, Thomas Jarrold wrote: 


It is reversing the order established in the world, to increase the quantity 
of subsistence, under an idea that at some future period children will be 
born to consume it. In place of saying, population increases where sub- 
sistence increases, it would be more correct to say, subsistence increases 
because population increases, but this would be to destroy the doctrine Mr. 
M. labours with so much pains to establish (pp. 58-59). 


The force of the criticisms is also weakened by the mode of presentation: 
one meets an argument—or its contradiction—in chapter after chapter, with 
seldom a remark from Smith (unless Malthus is speaking). The defenders 
of Malthus are not included in the survey. 

The second half of the book is devoted to an analysis of the Essay and of 
applications of its theories to the doctrine of perfectibility, poor laws, wages, 
emigration, and birth control. Here some of the contemporary criticisms are 
more systematically reviewed, and supplemented by others of Smith. The 
critical analysis is persuasive, and certainly the main conclusions of the 
Malthusian theory are shown to be invalid. The only serious objection to 
this section is that Smith, overwhelmed by the riches of targets, disputes 
many points and emphasizes none: for example, about equal space is given 
to technological progress (pp. 230-32) and to Malthus’ conjectures about 
North American Indians (pp. 257-59). The chapters on applications are use- 
ful summaries, but I wish he had gone outside the range of the critics (thus 
even Ricardo is not discussed). 

Malthus concluded his Essay on a high note: “I shall always be quite 
ready to blot out any part of the work which is considered by a competent 
tribunal as having a tendency ... to impede the progress of truth.” The 
tribunal should be formed; we now have a prosecuting attorney. 


Demographic Yearbook, 1949-50. New York: Population Division and Statistical 
Office, United Nations, 1950. Pp. 558. $6.00. 


Frank Lorimer, American University 


7 second issue of the United Nations Demographic Yearbook carries for- 
ward the most comprehensive series of official data on population ever 
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assembled for publication. Some topics treated in the first issue (life tables 
and death rates by age and sex, and distribution of population by rural and 
urban classification and by size of locality) are omitted here in order to pro- 
vide space for additional tables on marriage and fertility, selected for special 
emphasis in this issue. Other topics will be selected for special emphasis in 
subsequent issues (e.g. mortality, in 1951). The successive issues are, in part, 
complementary in coverage. 

This issue contains an important new feature: a series of estimates on 
world population trends, by broad regions, 1920-1949, prepared by the 
secretariat. In preparing these estimates it was possible to escape from the 
limitation of strict adherence to official data imposed on the treatment of 
national statistics. The estimates of regional trends are, of course, fallible; 
but sources of possible error are described in the accompaning text. How- 
ever, they meet a real need, and it is doubtful that much increase in ac- 
curacy could be obtained merely by more exhaustive research in the absence 
of more reliable basic data. 

Apart from these estimates, the material presented should be understood 
as an exposition of official data, rather than as exposition of demographic 
characteristics. This qualification applies to all tables. For example, the table 
on “Number of Live Births” should properly be entitled “Number of Re- 
ported Live Births’—the latter number being in some cases radically 
different from the former. The text provides some partial correctives for this 
limitation, through a classification of current official estimates of total 
population by type, according to the nature of the source used in their 
preparation, and by a series of qualifying footnotes. However, it must be 
observed that because these notes are subject to official approval, absence of 
notation and in some cases the actual notations may be misleading (e.g. as 
regards the completeness of birth registration in India). 

This immensely useful compendium bears witness not only to the com- 
petence of the United Nations demographic and statistical staffs, but also 
to the cooperation of statesmen and civil servants of many governments in 
an international undertaking. The paucity of demographic data for the 
Soviet Union reveals the most unfortunate breach in the general pattern of 
constructive international action. The scope of the volume as compared 
with earlier handbooks under different auspices is evidence of progress in 
many countries toward more adequate statistical systems though the un- 
fulfilled needs in this respect may appear even more obvious. 


Cost of Medical Care: The Expenditures for Medical Care of 455 Families in the 
San Francisco Bay Area, 1947-1948. Emily H. Huntington. Berkeley and Los 
Angeles: University of California Press, 1951. Pp. xvi, 146. $2.50. 


Mara@aret G. Rei, University of Chicago 


HIS study was made under the auspices of the Heller Committee for Re- 
search in Social Economics to determine “the burden of medical bills on 
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a group of families which represent a “moderate income” level during the 
period 1947-48 (p. vii). It supplements the study made two decades earlier 
by the Committee on the Costs of Medical Care. In order to increase the 
homogeneity of the group and the validity of conclusions from a small 
sample the study was confined to families of milk wagon drivers, grocery 
clerks, and painters rather than including families of all wage-earners. These 
occupations were chosen because of their size and of the fact that family in- 
come of these groups fell in the middle range. A random sample of each oc- 
cupation was drawn from membership lists of male members of labor unions. 
The study was restricted to families with husbands and wives who had lived 
together throughout the entire year. Reports of 455 families were included. 
In the sample drawn to secure these there were 90 families who refused to 
cooperate or supplied inadequate information so that their schedules were 
not included. In addition a large number of families of persons whose names 
were drawn were never contacted. Such gaps in the sample of studies of this 
type are not unusual. Huntington states that “the possible biasing effect of 
the refusals in the final results cannot be measured, but it is not believed 
to be significant.” 

Families reported illnesses and medical and dental services and charges 
for these for each family member and expenditures for all types of health 
insurance. The study by the C.C.M.C. during the late twenties reported 
that families in California and throughout the nation spent “4 or 5 per cent 
of their income for medical care,” whereas this study reports 7.5 per cent 
(p. 92). No attention is given to the fact that no such shift is reflected in the 
annual income and expenditure data for the United States as a whole as 
estimated by the U.S. Department of Commerce. During 1929, expenditures 
for medical and dental care were 3.7 per cent of personal disposal income and 
during 1947 and 1948, 4.1 per cent. 

This study, in common with other studies, shows that expenditures for 
medical care are very unequally distributed. It is unique, however, in the 
attention given to prepayment plans and their bearing in the reduction of 
excessive costs. The conclusion is reached that although 60 per cent of the 
families reported one or more prepayment plan (p. 82) these were in general 
very inadequate. Huntington concludes that “unless the provisions of the 
prepayment plans available to these families are drastically changed, it is 
unlikely that anything approaching membership by the total group and 
protection against the full costs of medical bills can be expected in the 
future” (p. 105). She does not spell out in any detail the nature of the 
modification in prepayment plans that are needed nor the difficulties likely 
to be encountered in developing these and extending their use. Nor does the 
study contribute to our knowledge of how the “burden” of medical care is 
met—how, for example, families meet the bills which are equal in any one 
year to a half their incomes. 

The report includes a copy of the schedule and comparative data from 
other studies. 
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Social and Psychological Factors Affecting Fertility., Vol. Two. P. K. Whelpton 
and Clyde V. Kiser, Editors. New York: Milbank Memorial Fund, 1950. Pp. vi, 
328. $1.00. Paper. 


A. J. Jarre, Columbia University 


HE historical downward trend in the birth rate in the U. S. and in other 
"T industaialioed countries is largely a function of “changes in human values 
and modes of life inherent in the transition from rural and agricultural to 
urban and industrial economies” (p. 120). Since this declining fertility 
has been chiefly voluntary it seemed highly appropriate to investigate the 
individual motivation and attitudes toward size of family. With this in 
mind the Study of Social and Psychological Factors Affecting Fertility was 
planned before World War II; the data were collected from a sample of the 
Indianapolis population between April 1941 and January 1942. Since then 
a number of articles by various students have been published in the Milbank 
Memorial Fund Quarterly. The first three of these articles were subsequently 
reproduced as Volume One, Social and Psychological Factors Affecting Fertility 
and issued in 1946. 

Volume Two consists of seven articles which appeared in the Quarterly 
at various times between October 1945 and July 1950. These seven articles 
were prepared by P. K. Whelpton, Clyde V. Kiser, Robert B. Reed, and 
Ronald Freedman. No attempt was made in either volume to integrate the 
articles; rather it is planned to prepare an integrated report on the Study 
after the series of individual articles have been completed. Accordingly, 
Volume Two cannot be reviewed as a unified whole. 

The data were obtained by means of lengthy and intensive interviews. 
The Study concentrated “on couples who had at least tried to plan size of 
family to some extent” (p. 155). Accordingly, questions were asked con- 
cerning such topics as the use of contraception, attitudes toward sex of the 
children, attitudes toward the number of children desired, degree of marital 
adjustment, etc. Because of the lengthy interviewing, the costs per interview 
were large, and the number of interviews had to be minimized. This, in turn, 
meant that a sample adequate to represent all segments of the population 
would be far too expensive. Accordingly, a final sample of some 2,000 cases 
was designed to provide information only about families in which the hus- 
band and wife were native white; both Protestant; married in 1927, 1928, 
or 1929; wife under age 30 and husband under 40 at time of marriage; 
neither previously married; residents of a large city most of the time since 
marriage; and both elementary school graduates. From the nature of the 
primary emphasis of the study as previously quoted, and from the nature 
of the sample, it seems that comparatively little attention was paid to the 
factors working for and against family planning as such. 

Some of the analysis can be described in terms of the titles which appear 
in this volume, as for example, “The Interrelationship of Marital Adjust- 
ment, Fertility Control, and Size of Family,” and “Fertility Planning and 
Fertility Rates by Religious Interest and Denomination.” Without attempt- 
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ing to state the detailed findings of either of these two studies we may note 
that both are handicapped by lack of sufficient cases. The number of vari- 
ables which had to be considered was so great that the size of the sample 
was insufficient for the number of cross-tabulations needed. For example, a 
decline in marital adjustment with increasing family size was noted; because 
of the small number of cases, additional control factors could be brought 
into the analysis only with difficulty, if at all. “High economic status... 
has been shown to be associated with low fertility. It is also associated to 
some extent with good marital adjustment and probably with success in 
controlling fertility. Thus, differences in economic status or some similar 
variable may account, in part, for the relationships found between marital 
adjustment, size of family and fertility control” (p. 301). 

The art, or science, of attitude investigation, has made remarkable progress 
in the last ten years. It should be remembered that this study was designed 
over ten years ago, and many of the analyses now available in print were 
made five or more years ago. The most recently developed methods for 
analyzing attitude data have, therefore, not always been employed. It 
would appear, for example, that some recently-developed form of scaling 
could have been used successfully in allocating the families along a continuum 
of marital adjustment. 

From the historical perspective, this study must be considered of great 
significance, despite its limitations. It is a pioneering study in one of the 
most complex and difficult fields in the social sciences. Future investigations 
in this area will benefit greatly from the experiences of this study. But 
even future investigations, no matter how well planned, will be partly 
outmoded by the time they are completed. 


Demography. Peter R. Cox (Member, Institute of Actuaries and Faculty of 
Actuaries, Great Britain). Cambridge, England: Cambridge University Press, 
1950. Pp. xii, 326. 


Frank Lorimer, The American University 


HIS book presents a series of lectures on demography to British actuarial 
y proton Its limitation to a particular professional group accounts, at 
least in part, for an almost exclusive reliance on British sources. Projections 
of British population by Notestein and associates are treated, but with only 
meagre exposition of the theory and methodology of these projections. The 
treatment of Lotka’s contributions to demography is drawn entirely from one 
article by a British actuary. The theoretical implications of these contribu- 
tions are not adequately developed, though some of its inherent limitations 
are properly recognized in the chapter on “The Statistical Investigation of 
Fertility.” The section on “Demography Abroad” contains only two brief 
paragraphs. 
The exposition is generally succinct and accurate though occasionally, as 
in the description of the gross reproduction rate, the language is awkward 





350 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1952 


if not misleading. One of the useful features of the book is its detailed and 
critical treatment of official sources of basic data on population in Great 
Britain. 


Selected References on Test Construction, Mental Test Theory, and Statistics, 
1929-1949. Howard W. Goheen and Samuel Kavruck (Test Development Section 
of the United States Civil Service Commission). Washington: Government 
Printing Office, 1950. Pp. xii, 209. Superintendent of Documents. $1.50. Paper. 


WarkEN S. TorGerson, Social Science Research Council 


7 is a bibliography designed to aid the test technician engaged in 
selection and placement activities. The authors state that the bibliog- 
raphy has been “developed, little by little, as the need for various parts of 
it has arisen in the Test Development Section.” 

Except for a few outstanding earlier articles, the references were selected 
from papers appearing between 1929 and 1949. The references cited are 
primarily papers on methodology rather than on the construction or uses of 
specific tests. Papers on statistics included are limited to those “pertinent 
to the field of test construction and allied research.” 

The authors state that “no attempt has been made to evaluate the 
references selected other than for their pertinence or bearing on a particular 
problem or on some specific aspect of broader problems.” While they have 
“had occasion in their work to refer to hundreds of articles appearing in the 
bibliography,” there are also “hundreds of listings which they have not had 
occasion or opportunity to read.” 

The book is divided into six parts: a short section on how to use the book; 
a rather good table of contents; the bibliography itself; a list of periodical 
abbreviations with the complete titles; an author index; and a subject-matter 
index. The bibliography is divided in 14 main sections, which are further 
divided into subsections. References are arranged alphabetically by author 
within each subsection but are numbered cumulatively. Cross references 
are given at the end of each subsection. 

The section headings will serve to indicate the breadth of coverage: 
Experimental design; Job analysis; Criteria and rating scale methods; 
Test construction theory; Problems relative to test construction in specific 
areas; Problems relative to test construction of specific types; Problems of 
administration; Realibility; Validity and prediction efficiency; Statistical 
theory; Short-cut methods, graphs, charts, and tables; Oral examinations 
and selection interview techniques; Attitude-opinion methodology; and 
Bibliographies. 

A few rather specific criticisms might be mentioned. The subsection 
entitled “Item analysis and item construction” would better be called 
simply “Item analysis,” since it is almost entirely devoted to this topic while 
most of the papers dealing with construction of items appear in other sec- 
tions. With respect to placing of papers under particular categories, there is 
inevitably room for disagreement. Three papers, however, were noticed 
which seem definitely out of place. Two articles by Baxter on factorial design 
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somehow found their way into the subsection on factor analysis. The fourth 
yolume of the American Soldier series (Measurement and Prediction), which 
is entirely devoted to attitude-opinion methodology, is placed among the 
general references to test construction theory. A number of typographical 
errors were found but none were serious enough to cause any misunderstand- 
ing. Incidentally, the bibliography does not contain 2,544 different titles, 
since several titles are listed twice. 

The coverage seems to be quite complete, particularly in those areas 
central to the purpose of the book. Those concerned with selection, place- 
ment, and evaluation procedures will find the book very useful. 


Statistische Urteilsbildung. H. Gebelein and H. J. Heite. Berlin: Julius Springer, 
1951. Pp. xvi, 192. Paper. DM 15.60. 


H. B. Mann, Ohio State University 


ye book is written for medical and biological researchers and contains a 
selection of statistical methods which the authors consider useful in these 
sciences. The first twelve chapters of the book are devoted to descriptive 
statistics. In these chapters the authors describe methods for computing the 
sample values of mean, variance, skewness, correlation coefficients and so 
forth. The Gaussian distribution is discussed and the distribution of a quan- 
tity z such that In(1+ cz), (c fixed) is normally distributed with mean 0. 
The latter distribution is termed the normal distribution of the second kind 
and the authors assert that it is frequently encountered in medical and 
biological data. The first twelve chapters use statistics and sample distribu- 
tions merely as a method for summarizing information contained in the 
sample. Only in Chapters XIII and XIV is there an indication that they 
may be used for the purpose of testing hypotheses. In Chapter XIII only 
the x? and ¢ tests are discussed. The F test is mentioned in small print only 
and its use is not discussed. Chapter XIV deals with the testing of class 
frequencies. Some of the tests recommended in this chapter mystify the re- 
viewer and he can only venture a guess that they are obtained by means of 
the Bayes theorem under some assumption about the a priori distribution 
of the parameters. Chapter XV gives a short and rather incomplete discus- 
sion of confidence intervals. For interval estimates of unknown probabil- 
ities an approximation is recommended which the authors call “Mutungs- 
grenzen.” Chapter XVI discusses the testing of contingency tables with 
special emphasis on small class frequencies. 

Since the authors nowhere indicate the principles of statistical inference 
which lead them to their test procedures, the book must be considered as a 
book on statistical method only. As such it does not give sufficient space to 
tests of hypotheses and interva! estimation and overemphasizes purely 
descriptive procedures. The usefulness of this book for the study of statistical 
methods is made even more doubtful by the absence of adequate tables. 
Even tables of the x? and ¢ distributions are missing. The analysis of variance 
is completely ignored, 
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